Monarch geneset OGS2.0

DPOGS202454
TranscriptDPOGS202454-TA1131 bp
ProteinDPOGS202454-PA376 aa
Genomic positionDPSCF300174 + 38802-44895
RNAseq coverage67x (Rank: top 67%)
Annotation
HeliconiusHMEL0173047e-7150.79% 
BombyxBGIBMGA009953-TA5e-7152.76% 
DrosophilaSpn4-PB1e-2831.06% 
EBI UniRef50UniRef50_P229224e-7152.67%Antitrypsin n=73 Tax=Ditrysia RepID=A1AT_BOMMO
NCBI RefSeqNP_001166849.17e-7252.67%antitrypsin isoform 2 precursor [Bombyx mori]
NCBI nr blastpgi|13781322e-8057.63%serpin 1 [Manduca sexta]
NCBI nr blastxgi|13781303e-7857.63%serpin 1 [Manduca sexta]
Group
Gene OntologyGO:00048677.1e-61serine-type endopeptidase inhibitor activity
KEGG pathwaydgr:Dgri_GH198621e-33 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[108-376] IPR0237965e-61Serpin domain
[141-371] IPR0002157.1e-61Protease inhibitor I4, serpin
Orthology groupMCL10132 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202454-TA
ATGGTAGTGACCGGAAAAGGAAACAGACTGCCACGTTGTGTGGTGGACGTTACCGTCAGCTCACCTGACTTGCTAGGCAAGGTGGGAGAGAAGAGGGTGACCGGAAAAGCCACTTTAAGCAAAGGAGGGAAGGATGAACCAGACAACATCAAGAAGGAATACAGTGCGATTGTGGTTGAGGTATGTGAGGAGGTTCTCCTAAAACAAGGTGTAGGAACCAACCTGCAATTGCTTGGTTACCAAAGAACTGGAGGAAATAAAAGGAGAAACTATTCACCATCGAATCACCGAATCTACTGTGCGGCAGACTGTAGAAAAGTTATGGAAGATAACTTAAACAAAAAGCACATATTTTTCTTTGCTATCACGGCTATGGCTAGCGAAAAGACTTTAAATGAAATGCTTTTTAATAGCAATACCCAATTCACAACAAAAATGTTTAAAGAAGTAGTAAAAGCCAAACCAGGACAAAGTGTAGTGCTATCAGCTTTTTCGGTTCTGCCACCTCTTGCTCACCTTGCTTTAGCATCTGTTGGGGAATCACACGATGAACTTCTTGATGTAATTGAAATGCCAAATGACAACGTTACTAAAGCAGTATTTTCAAAAGCAAACACCGTTTTGAGATTAGTAAAAGGAGTGACTCTTAAAATGGCAAGCAAAGTCTACGTGGCTGAGAATTATGCATTAAACAGGGACTTTGCCGCCCTTAGTCAAGATGTTTTTGGATCTGAAGTTGAAAATATCGATTTTTCTGAAAACGAAAATGCCTCTAAGAAAATCAATCAATGGGTTGAAGATGAAACAAATAATCGAATTAAAGATCTAGTAGACCCCACATCCCTAGATGCTGATACCAAAGCTGGTGCATGGAAAACTCCTTTTGACAAGAAAAGCACAACCGACAGAGATTTCCATGTGAGCAAAGAAAATGTTGTCAAAGTACCCACCATGTACAATTCAGACACCTTTTATTACATCGATAGCGAGGAACTCGACGCACAGGTATTGGAACTTAAATATGAAGGAGAAGATTCTGCTCTGTATGTTGTCCTTCCACATGATGTAGATGGAATTAACAAATTGAAAGAAAAGCTCAGAGACCCATCAATTTTTTTCAATTTAATTTAG

Protein sequence:

>DPOGS202454-PA
MVVTGKGNRLPRCVVDVTVSSPDLLGKVGEKRVTGKATLSKGGKDEPDNIKKEYSAIVVEVCEEVLLKQGVGTNLQLLGYQRTGGNKRRNYSPSNHRIYCAADCRKVMEDNLNKKHIFFFAITAMASEKTLNEMLFNSNTQFTTKMFKEVVKAKPGQSVVLSAFSVLPPLAHLALASVGESHDELLDVIEMPNDNVTKAVFSKANTVLRLVKGVTLKMASKVYVAENYALNRDFAALSQDVFGSEVENIDFSENENASKKINQWVEDETNNRIKDLVDPTSLDADTKAGAWKTPFDKKSTTDRDFHVSKENVVKVPTMYNSDTFYYIDSEELDAQVLELKYEGEDSALYVVLPHDVDGINKLKEKLRDPSIFFNLI-