Monarch geneset OGS2.0

DPOGS206012
TranscriptDPOGS206012-TA5148 bp
ProteinDPOGS206012-PA1715 aa
Genomic positionDPSCF300253 + 87829-112192
RNAseq coverage83x (Rank: top 64%)
Annotation
HeliconiusHMEL0115490.050.29% 
BombyxBGIBMGA012631-TA0.044.36% 
DrosophilaTepII-PA3e-5423.41% 
EBI UniRef50UniRef50_D6BMV21e-8624.48%Alpha2 macroglobulin isoform 2 n=1 Tax=Fenneropenaeus chinensis RepID=D6BMV2_FENCH
NCBI RefSeqXP_392454.25e-7127.10%PREDICTED: similar to alpha-2-macroglobulin-like 1 [Apis mellifera]
NCBI nr blastpgi|2557397454e-8624.48%alpha2 macroglobulin isoform 2 [Fenneropenaeus chinensis]
NCBI nr blastxgi|2700110608e-8625.28%hypothetical protein TcasGA2_TC009667 [Tribolium castaneum]
Group
Gene OntologyGO:00056151.2e-38extracellular space
GO:00048662.3e-15endopeptidase inhibitor activity
GO:00055764.6e-12extracellular region
KEGG pathwayxla:4477431e-77 
 K03910 (A2M)maps-> Complement and coagulation cascades
InterPro domain[998-1306] IPR0089302.2e-43Terpenoid cylases/protein prenyltransferase alpha-alpha toroid
[1048-1300] IPR0116261.2e-38A-macroglobulin complement component
[789-875] IPR0015992.3e-15Alpha-2-macroglobulin
[547-699] IPR0116252.5e-13Alpha-2-macroglobulin, N-terminal 2
[1369-1507] IPR0090484.6e-12Alpha-macroglobulin, receptor-binding
[203-297] IPR0028902.6e-10Alpha-2-macroglobulin, N-terminal
[1005-1033] IPR0195656.1e-09Alpha-2-macroglobulin, thiol-ester bond-forming
Orthology groupMCL11344 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206012-TA
ATGAAAATGAAGTGGTTGTTGTTACTGTTCGTGTTACCGGCACAATTGACTGGGACGCCGCAGAATTTGACAAGCTCGCCATGCACCGACAGAAACCACATATTCCTGATGCCTGGCGCACTCACGGCTGGTGGTTCAAGTCGGGCCTGTGTCTCCCGGTTCTACACAGAGGGTTCTGCTCAAATGACACTCACATTGAATGTTGATGGAGAAACTGTGACTTCGAGTCGAGACCTTCATAGAGACGGTGGTTGTCTGGATATTTCAGTTCCACAGCGACCAAATTCCAAAGCCGATGTCTATATTAATATAAGATATCCTCAATGCGTGTGGGAGCGACATATGAAAGTTCGTGTTTCAACCGGCCGCGTGGTGATAGTACACACGGAGCGAGCTCGGTATAGACCTGGAGACTTGGTGAGGGTTAGAGCTCTCGTCGTTAAAGCTGATCTGACGCCTGCACATACTGCTGTGAGAGACGGTGGATGTCTGGATATCTCAGTTCCACAGCGACCAAATTCCAAAGCCGATGTCTATATTAATATAAGATATCCTCAATGCGTGTGGGAGCGACATATGAGAGTTCGTGTCTCAACCGGCCGCGTGGTGATAGTACATACGGAGCGAGCTCGGTATAGACCTGGAGACTTGGTGAGGGTTAGAGCTCTCGTCGTTAAAGCTGATCTGACGCCCGCACATACTGCTATTGACGAAATATGGCTGGAGGGACCTGGGGGGTGGGGGGTTAAGACAGCACAGTGGCTCAAACTACGACCTCGACTGGGACTGGTCCAGGTTCAACATCAGTTGGACGACAGCGCTCCGCCAGGGAAATGGAGAGTGAGGACTCGCCTTGCTGATGGAGCTCAGGGTTCCTCTTCCTTTTTGGTGGGTAACTACGAGCTACCACCATTTCAACTTACAGTTCGCCACTCTCCAAGAATATTAAGGACCAGCGAGAGACTTGTGTGGACCGTGTGTGTGAGGTATCCCTGGTCTGAAGCTGTGGAAGGCATGTTGGTGATACGTCTCCGCGGTGCGGGTGGTGGAGATGGTGCTGGAATCAGAACTGCTGTCCGTCTGAAGGCGCCTCGAGCATGCCACAGACACGCCGCAGCTGCCAAACGAATAGGACTGAACGGTGACAACCCTCCTGATGTGGTTGTCGCCGACTTTTCCTTTGAAGAGGAAGGCACCGGTGTGTGGCAGAATACTACAGTGGTGTCCCAAGTCGTTGATGAAGCGGTTACTTTGGAGTTCCTCACTAAACATCGCGTCATCATATCACCCGGGCTGCCTCATAAGATTAAAATAAAGGCTACTCGTTGGGACAACAAGCCTGCTGCTGGCGAGCGAGTGAACGTATGCCGGTCTGCATCTTCCATTATTGCATCCGAATTTAATTCCACTGAAGCTATTTGTGTGAACGCTACCACAGACGAGAAGGGAATAGCAAGAGTCATGTTCAATGCCGACAATAGTCCATATTACAACTTACAGGCCAGTATTAACAACACTCGAACGACCACTCAAGTCCTCGTAGTTCGATCTGGCCGAGCAGCCCTCGGAGCCCTTCGCTCAGAACAGCACGGTGCTGCCTTATTGCCGCTGTACATAGACCTGAAAGTCGTCGCACCGCTTACGGTGCATTTCGTGGTCATCACTCGCGGCGGCATCATCTTTCGATGGGGAGCGACAACTCAATGTCCCATAACAAGTCCAACAGACAAAATAATAACGTCCCCGAGAAATAGCATCTGCCCTAACACAAACCCATATTCAATTGATAAGATCCTTAATAACAACCTGGAGTCCAACTCCACCGAACTGGAGACGCTTCTAGATAACTATTTATCTAAAGTAATGTTACCAATAAAAGTCAGCCCTCAAATGTGTCCGGAAAGTCATCTCCTAGCGTACTTCTACCACAATGATGAACTTATCACGGCCAGCAAACATTTTGAAATGGAAGACTGCTTCGTCAGAAAGGTGGATGCTTCGTGGTCACCACGTTTGGTTGCACCTGGTTCTCTGGCAACGCTGCAACTCACCACTCCTGGACCAGCATTATGTGCTCTTACAGTTTTGGATACGGCTTCTAAATGGATTCAATACGAAAATATCAGGGAGCTGGTGATGAACGGCCTGCGAACGCTCATGGACAGTCACAGGAATTTGACAGAACATGATGCTGCGTGGGAGTGCTTTCTAACATCGGAGAGCCCGGTTTTGTCTACAAGTCGCGACCTCTTGAGCTGGTGGCTGGCCTCGGCAGGAGTGAGACTCCTGGGAGACCATCCATCATCCTGCGAAGCCCCAGAGCTGATGATAGATGACGTATTACCTAGAAGTGATTTCTCAGAGTCGTGGTTATGGAAGCTGGCTCCAGTCAGCTCGCGAGGTTCCTTATCTGTGACGTCCCGCGCACCTCATACAGTAACGAGATATGAAGCGACTGCTCTCTGTGTGTCTCGCGCTGGTCTAGCCATTTCTTCACCCGCCGTGCTTCAGGTGTTCCGCGAGGTGTTTATTCACGCGAGCGGTCCTCGCCGCGTGCGTCGCGGTGACGCCATTCTTGTTCCATACAGAGTTTTTAATTATCTGTACACACCACACGCAGTGGAAATCATCATAACGACGAATCACGTTGTGGATGGATTAACTCGTGAGGTTGTATGTCTGTCTGCTCGCACGTCCACCGCGCGCCGTATGATGGTCACGTGTCAAGACTCGGACCTCCTCAGCATCAGAGCGACAGGAGTGAAAGATGCCAACTGCAGTACGGACTATAGAGAGTTCAGCGATGAAGTTGTAATCCACATCCAGGTTGATCCCGAGGGTGTTCCAGTGCGGGAGATGAAGTCTGCACTGTTATGCGGAGTTGATAGTGTGAACTTCACGAGTTCGTCGGAAGTGACGTGGGACTGGTCATCGGAGAGAGCGCTGCCAGGCACCGAGTCCCTGACCGTATGGACAACCACAGACCTCATGGGACCTCTACTGGCTCATGCAGATGGATTAGTGGATCTCCCGCGAGGCTGCGGCGAACAGAATATGGCCCGCCTCGCTACCGCTCTGCTCGCATTGCGGCTTTTGGAACCTCACTCACCCGCCGCAGACGACGCGAAGGATCAAGTAGCTAGAGGGTTCACCCGTCAGCTACAGTACGCTCACGTGGGTGGGGGGTTCAGCGCCTTTGGTAAGAACGACCCTACTCCCTCCACCTGGCTCACAGCATTCAGCTTGAGATACATGAGGAAGGCGTATGAGGTCATATCAGGTTCCGGTCCGCTGCCTCCAGTGTTAGAATTGTCTCGAGACTGGTTGCTCAACCAGCAACTCGAAAATGGTTGTTTCAGTAACACTGGACACGTCTTCCATCATCTACTCAAGGGCGGTCTAGACGAGGATGGAGAAATAGCCAATGTGGCCCTCACAGCCTACGTCATCGCCTCACTCACAGAAACCTCCCTCCCTTACAAGATCCTCAACAATTCCCTCCCATGTCTTCGCGCCCTGGTTCCCATGAGGACCAAAACTAATTCAAGAGTATACGCACAAGCCTTGATAACTTACGCGTTCATGAAGTTGAGGAAATATGAGGAATTAGGTAATGATACCTTGATGGGGAGTTTGGAGGAAGATTACTTGAGAGAACTGATCGAGTTACTAAGGATCGCTAAGAGGAGTGGAGATTTTGTATGGTGGGAAACTGGTAACCTGGCTACATCCATCGAGTCCACGGGATACGCTCTCCTGTCTCTATCTGAGTGTCCGCCGAGGAGAGGCTGCGAGGTAGCCGCGGCCGGTTCGCTGAGATGGCTGGCGGCCCACAGAGGCACTTCAGGGGGATTCCTCTCAACTCAGGACACGTTAGTGGCCCTGGAGGGTATGTCTCGTCTGTCGCCGCTACCTGCTGGAGGGCTGGTGACACTACAGTCTGGAGACGACACGAGGATCGTGACTCCTACAGCCGTCCCCGAGCTGGTGACGATGAAGGTGGACCAGCTGAGAGTCACCGTTGAAGGCCCCGGATGTGCTCTAGTTCAGGCCACTCACAGTTACAACACGCTTGAGCCACACGAGTACCAACTCGAGCCGAGCTCTCTCTCCGTACACACGAACGTTCAAACCGACGGTCCCTTCGATTGTGTCAACGACGTCTGCTTCTGTGCCGCCATAGTTAAGGTGTGTGCATGGTGGCGCGGACCCCTGCCCGCTATGTCTGTTCTGTCAGTGACGATGCCCACGGGTTACACTCCGAGCGCTACACACCTCTACTCCCAACTAAACAATCAAACTCTTCTCCGTCGCGTGGAGATCTCGTCCGTCAGCAACAAGGTGAGTCTGTATCTTGGGACACGTGATGGAAGCACCAGCTCAGACGGTCAGGAGTGCTACACGCTGCACCTCGTGGGACCGAGACTTAAAACTAAACCAGCCTACGCGGAAATCATAGACTACTACCGGCCACATGTCAGGGATATCCAAATGTTCACAATACCCGAGGACTGTCCTCCGAGGATAGCTCCGGACGATTCAAATCGATACATACCCTCCGATAATTTATTCAGCAAAGCGAAGTCATTAGACGGCGAAGACGTCGTCATATCGTACGAATACACCATAGACGATCTGCCAGAAGGAATACCGCTTGAAGATCCGATATACGACAACTTGACCAGAATTAATAAGGTCGTTGATAAGAAAACAAATAAAGATGTCCGTAAAAATATATTCAATGACAAACCAACAAATAAAACTGACGCTATCAATACTGGAAATACAGAGGAATTAAAGAACCAAAACGGAAAGATTCAATTGGATTATGACAAGAGATCAGATGATCCGAAAGAAGATACATTAGGGGATCCTCAAGCCGGTGCTCATCCTCACGAGAGTGAGTCGAAGCAGTCCTTGAATCCGATGCTATCCTCCTTTCATGTTATATACAGTGACCAGGACTTGCAGGTGCCCTCGGGGATCGAAGGTCCTGTACCTGCAATAGTTCTACCGCCGCCGGACTTCCTTGCTAGGAACAGAGGTTCCAGCTGGGGAGGAAATGTCGCCGTACCGGTAAATCCACAATATCTCAGAGAAAATTATTACAGAATGAATTATCTTTACGACGCCATGAAGCGGAAATAG

Protein sequence:

>DPOGS206012-PA
MKMKWLLLLFVLPAQLTGTPQNLTSSPCTDRNHIFLMPGALTAGGSSRACVSRFYTEGSAQMTLTLNVDGETVTSSRDLHRDGGCLDISVPQRPNSKADVYINIRYPQCVWERHMKVRVSTGRVVIVHTERARYRPGDLVRVRALVVKADLTPAHTAVRDGGCLDISVPQRPNSKADVYINIRYPQCVWERHMRVRVSTGRVVIVHTERARYRPGDLVRVRALVVKADLTPAHTAIDEIWLEGPGGWGVKTAQWLKLRPRLGLVQVQHQLDDSAPPGKWRVRTRLADGAQGSSSFLVGNYELPPFQLTVRHSPRILRTSERLVWTVCVRYPWSEAVEGMLVIRLRGAGGGDGAGIRTAVRLKAPRACHRHAAAAKRIGLNGDNPPDVVVADFSFEEEGTGVWQNTTVVSQVVDEAVTLEFLTKHRVIISPGLPHKIKIKATRWDNKPAAGERVNVCRSASSIIASEFNSTEAICVNATTDEKGIARVMFNADNSPYYNLQASINNTRTTTQVLVVRSGRAALGALRSEQHGAALLPLYIDLKVVAPLTVHFVVITRGGIIFRWGATTQCPITSPTDKIITSPRNSICPNTNPYSIDKILNNNLESNSTELETLLDNYLSKVMLPIKVSPQMCPESHLLAYFYHNDELITASKHFEMEDCFVRKVDASWSPRLVAPGSLATLQLTTPGPALCALTVLDTASKWIQYENIRELVMNGLRTLMDSHRNLTEHDAAWECFLTSESPVLSTSRDLLSWWLASAGVRLLGDHPSSCEAPELMIDDVLPRSDFSESWLWKLAPVSSRGSLSVTSRAPHTVTRYEATALCVSRAGLAISSPAVLQVFREVFIHASGPRRVRRGDAILVPYRVFNYLYTPHAVEIIITTNHVVDGLTREVVCLSARTSTARRMMVTCQDSDLLSIRATGVKDANCSTDYREFSDEVVIHIQVDPEGVPVREMKSALLCGVDSVNFTSSSEVTWDWSSERALPGTESLTVWTTTDLMGPLLAHADGLVDLPRGCGEQNMARLATALLALRLLEPHSPAADDAKDQVARGFTRQLQYAHVGGGFSAFGKNDPTPSTWLTAFSLRYMRKAYEVISGSGPLPPVLELSRDWLLNQQLENGCFSNTGHVFHHLLKGGLDEDGEIANVALTAYVIASLTETSLPYKILNNSLPCLRALVPMRTKTNSRVYAQALITYAFMKLRKYEELGNDTLMGSLEEDYLRELIELLRIAKRSGDFVWWETGNLATSIESTGYALLSLSECPPRRGCEVAAAGSLRWLAAHRGTSGGFLSTQDTLVALEGMSRLSPLPAGGLVTLQSGDDTRIVTPTAVPELVTMKVDQLRVTVEGPGCALVQATHSYNTLEPHEYQLEPSSLSVHTNVQTDGPFDCVNDVCFCAAIVKVCAWWRGPLPAMSVLSVTMPTGYTPSATHLYSQLNNQTLLRRVEISSVSNKVSLYLGTRDGSTSSDGQECYTLHLVGPRLKTKPAYAEIIDYYRPHVRDIQMFTIPEDCPPRIAPDDSNRYIPSDNLFSKAKSLDGEDVVISYEYTIDDLPEGIPLEDPIYDNLTRINKVVDKKTNKDVRKNIFNDKPTNKTDAINTGNTEELKNQNGKIQLDYDKRSDDPKEDTLGDPQAGAHPHESESKQSLNPMLSSFHVIYSDQDLQVPSGIEGPVPAIVLPPPDFLARNRGSSWGGNVAVPVNPQYLRENYYRMNYLYDAMKRK-