Monarch geneset OGS2.0

DPOGS215836
TranscriptDPOGS215836-TA4200 bp
ProteinDPOGS215836-PA1399 aa
Genomic positionDPSCF300073 + 435750-446705
RNAseq coverage795x (Rank: top 16%)
Annotation
HeliconiusHMEL0116470.068.72% 
BombyxBGIBMGA013565-TA0.058.65% 
DrosophilaTepIII-PA3e-13827.42% 
EBI UniRef50UniRef50_D6WNG76e-16628.86%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WNG7_TRICA
NCBI RefSeqXP_972838.11e-16628.86%PREDICTED: similar to tep3 [Tribolium castaneum]
NCBI nr blastpgi|910837952e-16528.86%PREDICTED: similar to tep3 [Tribolium castaneum]
NCBI nr blastxgi|3320312653e-16029.25%CD109 antigen [Acromyrmex echinatior]
Group
Gene OntologyGO:00056153.8e-24extracellular space
GO:00055769e-16extracellular region
GO:00048661.7e-14endopeptidase inhibitor activity
KEGG pathway 
InterPro domain[857-1148] IPR0089309.4e-32Terpenoid cylases/protein prenyltransferase alpha-alpha toroid
[912-1124] IPR0116263.8e-24A-macroglobulin complement component
[1208-1348] IPR0090489e-16Alpha-macroglobulin, receptor-binding
[84-166] IPR0028901.7e-14Alpha-2-macroglobulin, N-terminal
[647-736] IPR0015992.3e-09Alpha-2-macroglobulin
[394-523] IPR0116258.7e-09Alpha-2-macroglobulin, N-terminal 2
Orthology groupMCL10119 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215836-TA
ATGAGGTACAACGCAGCTGTTTTTACCGTCCTGGCAGTTTTAGTTGCTAAAAACAATATACAATGCGTATCAGTATTAGGACCGAAGGTCTTAAGGCCGTATGGGAATTACAAAGTATCTATCGCTGGAGGTGACAAAGCGCATAATCTATATGTGGCTATAGAGGGCAGGAAGACAACTGGGGAACAGTTCTCACAGGGGCGAGTAGTGCAAGTGGCACCTGCTTCTTCTAGACTTATAGAACTCGATACCGATAAAGGTGTATACCAACCAGGTGACACCATTAACTTCAGAGTAATCGCTTTGGACAAGTATCTGTTGCCTCTCTCTGGGACGGTGGATGTGAGTGTGTTGGATACCAAGGGCTCACCAGTGAGGCAATGGGCTTCCGTCAACCTCGATAAAGGATTGTTTTCTAACGAGCTTCTGTTAGCTGATGAACCCGCTTTAGGACAGTGGACTATACAAGCAGAGGTCAAGGGGCAGAAATATTCGAAACATCTGATGGTGGCAGATTATGTGCTACCTAAGTTCCAGATGCATATGAAAGTACCAAAAGAGGTTCTGTTTAGCGAAGGAAGATTTAATATTAATGTTACAGCCAGACATTTTAATGGCCTACCTGTAAAAGGTGAATTAACAATATCCGCATACGCTGTGTTCTTCTCGGGACTACTTCAACCGGTATTTTCATCTCCCGCCCGTAAAGTCATTGAGTTTAACGGCCAAGCGGAAGTTTTGTATGACCTTAAAACAGACTTAGATCTGGCTGAAGATGCAGCCAGACCGTTAGTAGTTGAAGCTGTGATAGAAGAAAAAAATACACTGATACGACAGAATATTACCACTAGAATACTTCTTTTGCGAAGACCCTATAGACTTCAAGTTACTGCTCCTGAGAGGTTTAAACCTAGATTACCTTATATTGTCCAGATACAATTAGTTAATTCTACTGGTGATACGTTACCTGTATCTGATGATGTAGTCGTTGAAAGACTTTGGGATGATGGTGCACCTGTTAACAAAACAACTATTAAACTTAACAAAGGTTTTGGAATTTACACCTACACTCCAGATGTTGCGCACACAAATTCTACTCTTAATTTAGTGATCAAATACAAGGAAGTATCAGAAAGAATAGTTAACGTCCAGAAGAGCTTGGAGACTGGTGATCAATACATGACTCTGGAACTGTTAACACGAAATATGTCTATCGGTGATGAGATGCGTGGGAGAGCCACCTCCACGGAACCTATGGATCTGGTGCATTATGCGGTCATCGGAAGAGGGGACATTCTTGTTGCTAAGACATTAGAATTAAGCCCCCCTCGTACCAGCGTGGATATCTCAGTACCGGTAACAAGTGGTATGTCTCCGGGCTGCTCGCTAATAGCTTGGAGCCCCCGATTAACAGGATCTATACTGGCTGCAGCTTTACTGGTTCCACAAAAAGACTTAATGCAACATAAGGTGTCAGTAACATCAGTATCGCCAGGAACATCACTACGTCCTAATGGCCTGGTGGAGTTTCGAGTGCTCGGTGAGGCGGGAGCTCAGGCTGGTCTACTTGGAGGAGATCAACACGCCATTACTAACGGACTCGCTGGAACCAATGGCCTGGGTAGCGGACTGGATTTACACACGATCGAACGAGAAGTTGAAAGCTTCATTGGCATAAAAAGATCATATTTCAAAAATGATGACGGAATTCCAATTTTGGGAATAGACTTAGGTGGACGTAACTCTACCGATGTGTTTAGTAATGCTGGAATGGTTCTTCTGACAGATGGTGTTGTAGTATCAAACAGTATGAAGGACGAAACAGAGAAACATGAGACAGGCACCCGCCCACCAACAGCAGGTCCTTACGCGTTCAGTAGAGTGCCAACGCCGCCATCGCCAAGACAATACTTGACTGAGACACTTTCACCACTTTCCACTTGGATGTTTACTAATATAACTATTGGTTCCGACGGCGTTGGTACACGACAGCGTTGGTCCCCAATAACTCCTGGTGAATGGTCGGTCGGAGCATTTGCGATTCATCCAACACTGGGTCTTGGTCTTGCGGCACCTCGCAAATTTAACACTGCCCTTCCTCTATCCCTCACAGCCGAACTTCCCGCAAGTCTTCAAAGAGGAGAAACAATAGCTGTGATTGTGACCTTAAAAAGTTCTCTTACAGTTGATACACCAGTAGAAGTCACATTCCACAACTCCGATCAGTACTACGAATTCGAACCTCTAGAAAATAATATTGACTCGACAAAAAAGATTGAATTGTTCCGTCGAGTAAGCGTAACCGTGCCAGCTCGCGGGTCCGTCAGTACGGCGTTCCTCGTGAGCGCTCGTCGCGTCGGTGACTCACCCATCATTGTGGAAGCCAACGGCAATGGAGTCTCCGCTTCACTCTTCCGCACCATTGACGTTCAGGACGGATACATTGAAGATGTCTGGTCTTGGGCAATATTAGACGGTCGTCGAGGCGTTGCTCGCGCTAATATCACTCTTGAACCAGCAGCCGGGACTAAGCTCGGAGCAGTTTCTTTGGAAGCTACTGGGGACTTATTGGCAAATGCATTTAGGGCCATTAAAGCGCCGCCTATATCAGCCGCTGACCCTAATTATGCGCTAAGACCATTGGCGAGAGCTTGCGTATTGTTGGACTATTTGCAAGCCACAGATCAAGACGATGAAATCACTATAGTAAAAGAGGCTCGATCACAAGCAGCTACCGGCTACCAACGACTTATGGCATTCAGACGACCAGACGGGTCGTTCGTTCAGGAAATTGGTGAAGAATCTGAACCAGATGTCTGGATGACAGCATTATCAGCTCGATGGCTAAGCCGTTCCTCGCGCTATGTTGAAGTGTCTCCTGAAGCTGCAACATCCGCGGCACGCTGGCTGGTGGCAGCTCAAAGAAGTGACGGTAGCTGGCAACCTTCGGCATCACCTGACGACCCGCTGGGTCGGGAAGCCTTGCCACTCACGGCCCAAGCTTTACTAGCACTATTAGAGACTAAGGCCAGCGACCCGTTGTACAAAAACGCTATGAATAAAGCTTTGGATTACCTAGCCGATAAAGTCTCTGAGTCACTCGAGGCACCGACACTGGCGTTAGTGGGAGCCGCTCTGGCCGCCGCAAGACATCCTCGTGCTGCGCTAGCTCTGAAAGCCCTGGAAACACATGCACACAGTGACAGAGGTACCAATCTCTACTGGCCTCGAAAATTATCAAAATCGGAGTTACGGAACCCCTGGCTGAAGGGTAATTCTCTTGAGGCTTCGACTGCAGCTTGGGGTCTACGCGCTATGTTGGCTTCCAGTCTGATAGATGAATCTGTACCTGTTGCGCGATACCTTATACAAGCACTAGGACCTAGAGACCACGACCCGGATGTGTTAGACGCTTTGGCCTTGTTTGCGCACATGATTAGAACGACGACCAAACTGAGGGTATCTGTAAATGTCACCGGTTTCGAGGAACCGCGCCAGTTCAACATCGACAGCGACAATTCACTGATCTTACAAACACAACTGGTACGCAATGCTCGTAATGCGAGTGCAGTGACCGAGGGTCGGGGTATGGCCGTGGTGGGTCTAGCGGCTCGTGGCAGTACTAACGTGACGGGTGCCTGGCCTCGTTACACGCTCGACCCACGCGTGGATCAGGTCTCTACCAGAGACCGACTTCAGCTGTCTGTATGCATCGGATTTGTTCCTGCTGGCAATGAAACAGAAAGCGGACTGGCTCTTCTAATTGTGCAATTACCGTCGGGATATTTGGCTGACATAAATACTATAACAGAGCTAACGTCGGCGCGTCATGTTGTGGGTGCTCGAGTGGTGCACGGTGGATCCCGCGTGGTATCATGGGTGCGACCCTCAGTACACGAGCGCTGCGCCACCCTCGGAGCTCCACGCGCTCTACCCGTCGCAAGACAGAGGCCTGGATATGTCACCATAGTGGATCTTTATGACTCTAGTCACCGAGCGCGTGTCTTTTACCAAGCTGTCCCAAGTACCGCGTGCGACATTTGTCGCTCGTGGCCCTCATGTGAGCGCGCTTGTGGTTCCGCAGCGGAACAGCGTGCTTCCCCCACCACCCCCGCCGCCACACGTAACCCCAACAGTGCATCTGTCCCGCTCGCACAAACTGTGCTCTGTCTCGCTTTGGCATTGTTAGTCAGTATATAA

Protein sequence:

>DPOGS215836-PA
MRYNAAVFTVLAVLVAKNNIQCVSVLGPKVLRPYGNYKVSIAGGDKAHNLYVAIEGRKTTGEQFSQGRVVQVAPASSRLIELDTDKGVYQPGDTINFRVIALDKYLLPLSGTVDVSVLDTKGSPVRQWASVNLDKGLFSNELLLADEPALGQWTIQAEVKGQKYSKHLMVADYVLPKFQMHMKVPKEVLFSEGRFNINVTARHFNGLPVKGELTISAYAVFFSGLLQPVFSSPARKVIEFNGQAEVLYDLKTDLDLAEDAARPLVVEAVIEEKNTLIRQNITTRILLLRRPYRLQVTAPERFKPRLPYIVQIQLVNSTGDTLPVSDDVVVERLWDDGAPVNKTTIKLNKGFGIYTYTPDVAHTNSTLNLVIKYKEVSERIVNVQKSLETGDQYMTLELLTRNMSIGDEMRGRATSTEPMDLVHYAVIGRGDILVAKTLELSPPRTSVDISVPVTSGMSPGCSLIAWSPRLTGSILAAALLVPQKDLMQHKVSVTSVSPGTSLRPNGLVEFRVLGEAGAQAGLLGGDQHAITNGLAGTNGLGSGLDLHTIEREVESFIGIKRSYFKNDDGIPILGIDLGGRNSTDVFSNAGMVLLTDGVVVSNSMKDETEKHETGTRPPTAGPYAFSRVPTPPSPRQYLTETLSPLSTWMFTNITIGSDGVGTRQRWSPITPGEWSVGAFAIHPTLGLGLAAPRKFNTALPLSLTAELPASLQRGETIAVIVTLKSSLTVDTPVEVTFHNSDQYYEFEPLENNIDSTKKIELFRRVSVTVPARGSVSTAFLVSARRVGDSPIIVEANGNGVSASLFRTIDVQDGYIEDVWSWAILDGRRGVARANITLEPAAGTKLGAVSLEATGDLLANAFRAIKAPPISAADPNYALRPLARACVLLDYLQATDQDDEITIVKEARSQAATGYQRLMAFRRPDGSFVQEIGEESEPDVWMTALSARWLSRSSRYVEVSPEAATSAARWLVAAQRSDGSWQPSASPDDPLGREALPLTAQALLALLETKASDPLYKNAMNKALDYLADKVSESLEAPTLALVGAALAAARHPRAALALKALETHAHSDRGTNLYWPRKLSKSELRNPWLKGNSLEASTAAWGLRAMLASSLIDESVPVARYLIQALGPRDHDPDVLDALALFAHMIRTTTKLRVSVNVTGFEEPRQFNIDSDNSLILQTQLVRNARNASAVTEGRGMAVVGLAARGSTNVTGAWPRYTLDPRVDQVSTRDRLQLSVCIGFVPAGNETESGLALLIVQLPSGYLADINTITELTSARHVVGARVVHGGSRVVSWVRPSVHERCATLGAPRALPVARQRPGYVTIVDLYDSSHRARVFYQAVPSTACDICRSWPSCERACGSAAEQRASPTTPAATRNPNSASVPLAQTVLCLALALLVSI-