Monarch geneset OGS2.0

DPOGS206648
TranscriptDPOGS206648-TA4374 bp
ProteinDPOGS206648-PA1457 aa
Genomic positionDPSCF300048 - 314943-324679
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0111340.050.33% 
Bombyx% 
DrosophilaAlkB-PA2e-6345.35% 
EBI UniRef50UniRef50_D2A2Y91e-6756.28%Putative uncharacterized protein GLEAN_07602 n=1 Tax=Tribolium castaneum RepID=D2A2Y9_TRICA
NCBI RefSeqXP_392512.22e-6948.88%PREDICTED: similar to AlkB CG33250-PA [Apis mellifera]
NCBI nr blastpgi|3407189781e-7049.06%PREDICTED: alkylated DNA repair protein alkB homolog 1-like [Bombus terrestris]
NCBI nr blastxgi|3323756487e-7252.06%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00041291.1e-12cytochrome-c oxidase activity
GO:00057391.1e-12mitochondrion
KEGG pathway 
InterPro domain[432-654] IPR0194424.7e-22Domain of unknown function DUF2428, death-receptor-like
[2-53] IPR0032131.1e-12Cytochrome c oxidase, subunit VIb
Orthology groupMCL16111 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206648-TA
ATGTCTTTCCCTGATAAACAACAACGAAAAGTTTGTTGGGATTCTAGAGATCGTTACTGGGCTTGTTTAGATGACCAAAATATTAAAGACAGTTCTGAAAAGCCAAAGGCATGTGCGGAATTTAGGAGACTATTTGAAAAATCTTGTCCACCTAAATGGGAGGTACTTATAGCAACGAAGAAAGCTAGCATTCAAAATGAAGAGTATGAAGAATTTAACATCATATTAGAAATGTTGTGGGACCATTTGAAAACTGGACTCGAAGAATTTGATGACCAATTACTTTGTTCTACATGTTTTTATAATGTCCTAAACAAGTCAACTAAACAAGAGTTCTATCTTAACCAGATCTTAATAATAATGAATGACGAACTAGAAATCAATGAGAAAAATCATGATAACATAAGTTTGGCTACAACTCTTTGTTATGGCCTCTATCAGTCCTCATATCTTGCTAATGATTTGAAAAATACAGCGGTACTTGAAAATACTATGAAATCAATTTTCAAATTGCTAGTAAGAATGTCATATGAGTATACACAGTATACATTTATAGTATTTAAGACTATGAGCTCATTTAAAAAAGTGATAGGAACATCATTAGAAATGACAATTTTCAGTAATGAAAATAAAATTAAAATGCTGCATGTAATAAACAATAATTGGGAGAATCCTATAACAGGAATACGAAACTTAAATAGGATTTTATTCCAAACATTACTTCAAGTTATCAATGACGATGAGTTGACACAAATAATAAGAGAAATTAATAAATTTTATTGGAACAAAGCAAAATATTTGATGCTGACCGAAGTGATTCTTAAGTACAGTGGGAATATTGTGGACTATGTGTTGAAAAATATGTGGCTTGGAGGACTATTGAATAGTTTATACAAGCCTGGCTTGGTATCCGCTGGCGCTGATATGTATTATGCTATTTTGGAGAAGCTTAATTCAGAAAATTGTTGGACAACATTGTTCCTGGAACCTGTAATCGAACTTCTTACAGGAGCCTCTTTTCAAGCTATAGAGAATTTTAAGAATTACTGGTGTTTGAACACTTTCAAGAAGTTTCCTACATTAGCCAAAGATTTAGTTCAACATTTGAAACTAGCCAGCGATTCAGAATTAAAGCTATACAGCATTTTATGTGTTCTTAAACAAGCCAACAAACTTGCTTTAATTGATAGAAAGTGGACAACAGCATCCCATAACAACGTTTTAGAAATAATATCTAATCAGTTTGCTTGGGAAGAAGACACTTTGACAAGTTCGGATTTTTCGAAAATGAGTGATATGGTACAGACTCTTATAAATAATTCTGAATATGCTGAAGAGAATAATCAAGACCAGACAAAGATTTCAGGTTTTCATCAGATCGTTCTGAATTGTCTGTGGCTAAATGTTAAGGCAGCTTGTGACTTAGCATCTTTGCTCATACTGTATTGCAAAGAATATGTAGAAGTTTGTGAAAAATGTTTGCACATAATTATGCATGTTCTAGAAACATCAAGACATAAAGGTGCCATAGAAGCGGCTGGGGCTGCCTTAGGTCAAGGAATCAATTACTTAACATCTACAAAGCTGAGCCCAGAGGTTTCACAATTACCATTTATTCTATTAAAGAAAAAATTAAATGAACTTTTATATGAAACAGCGAAAATGTCCTCTGTAACAAGAAGGGGTGCCGGCTTGTCAATAATGGTGCATAGAATTGTTAGTAATGATAGAAAAAAAGGAAAACCCCTGTTTCATTATTTTATGGTCGAATTATTAACAACATGCAAAAGTTTGGATAAAAATTGTGAGGTCGAGGAAAGTGATAAAGATCTGCCTAAAGCCATTTATATACATTTCCTGACAAGAATTGTGACTGACAGTAGTTTGGCTTCAGATATTATGTATTATTCAGCTGATCTAGCTGAATTAGCTTTCGATAATCTTACCAACTCCAATTGGCAGATCAGGAATGCAGCTCTTCAGTTATATGGTGCATTAATTCCAAAATTGATAGGTCAAAAAAAAGCTTCTGGGATTGATGAAGAAACCGTTGCTACTGTTGCATGTGATGAGTTCCTAACTCATTCTCCAAAGCTCTGGAAGTATATTGTCAAAGAAATTAATAACAATCAATCTAATGACATCATTCAGACACATTCAAATTTAGTTCCTATATTAAATGTGCTGGCAAATATTGCAAGGCGATACAATATCTCATGTGATGTTAAAGAACAAAGCAGTTTAGAATTAGATTTGTTAAAGAATCTTACATTATTACTGGGAAGTCCTATTTATACAGTAAGGAGACTTGCAGCAAAAAGTATTACGAACCTTTTCTGTTTTGATGTTGTCTATGATTTCATAAAAACTAATAATAATGATGCAGAAAATCATATTCATGGTTGTATAATGTTACTGTCCGAATTACACAAAATCTACTCAACTGAGGAAACATTAAGAACTCAACTTAATGAACTTATAAATAATTTGAATAATAAATTATTACAAGGGAATTACTCTTTTATATGCAGATGGGGTATTGAAATGTTATTACAAACAGAAAACCCCAAAAATATCACCATAGAACTGGTGCAAGAAATATTTTTACATACAAATGCTAATCGATATAAACCAGGGGTGGCTCTTTGGGAGAAGATTAAAGTTCAAAAATATTTGCAAGAGGCACCATTGAATGTAATCCCTTGTATTTTAAAAATGATCCTTGCATGTAATGAATATGAGAATTATTTGGACATACTTATAAATAGATTAGAAACCAGTGATGGCAATGGTCAGGTTACATTGGAAGAAATTGCTTATGCCCTTCTAAGTTCTGATTTAAAACTGCAAAGTAGCCATTTGTGGAAAATACTTTATTTAATATCGCTTAAAACGGATTTAAACTGTGATATGACACAAATTATGTCCTTAAACACAAATTTATTTAAAATTTCCTTCCAAATGAGATACATGATACCATTTGCTACAAGGTTATGTGTAACTCGAGGAGATAATTATTTATCTGTTTTATCAAATATTATACAGTCTTTATGCAATCCTGAAACCAATGATATAGATTTGCGGTATATAGCAGCAATATCAAACAATGAATTATCACATAAATTCAAAAGCCACAATGAGAATGTTAAAATTAATGCAATAATATCAGCAATCATCCTTTTACAAGACGAAGACGAAGATGTCCGGAATGTTAGTGTTGCATTTTATAAAAATGTTGTCAGTGATAAAAATCCTAAACAACCGTTTGTATGTCTAAATGCAATATTGGATATTGGGTTTTTATCAATAATGTTAAATCAACCTAAAATAAGTATTCCAAAAATTTGCACAGATTTGTTAAGTTTTATTGATGGCATGAGTCAAACAAAAGGCGATGAGATCATTGCAATGGAATCGAAGGAAAGTGCTAGAATACCTAGGGATGTATTTATGGAAACATTCAAATATTTTAAATCAAACAAACCAAAACCGTCTCTAGATAGAGTGATAACTACAGATTCTGATAACAGATTACTGCTGTTTCATAGAAGAGATATTATTGAAGACGTAAGGTCAAAATATTTGGGTTTGCGATCTCTAAAGGAATGGAAAGCCTACTCGTTGAAGAGTAATCCCGGCTTAATATTAATAAGAAACCCTTTCACTAGTCTAGGCCAAAGGTTTTGGATAAGAAAATGCTTAGAAATCTATCCAAAGAAACCAAATAGAACAAATATTGATGTAGAAACACATATAGATGACTGGTGGTCGGAATGTCATTATAGTGAGAGAAATAATAGCAAGCTTCTTAAGAAACTGCGCTGGACAACATTAGGATACCATCATAACTGGGACACAAAAGTTTACACTGAAGAAAATAAAGGAGTATTTCCCAGTGAACTTTCAGAGTTAGCTGACATAGTAGCACATTATCTTGGCTATGAAGGATTTCGTGCAGAAGCAGCGATTGTTAATTACTATCACATGAGTTCCACACTTTCTGCTCACACTGATCATTCAGAAATCAACTTAGAAGCACCTCTATTTTCATTTAGTTTCGGACAGTCGTCGATATTCCTAATAGGTGGTCAAGATAAATTTATAAATCCAATTCCTATTTTACTGAACAGCGGTGACATAGTCATCATGTCAAAGGAAGCAAGGCTGTGCTATCACGCTGTCCCTAAAATATTACCTTCACCTACGACACCCTGGAATAATCAAGAAGATTCCGGAGGAATGGATTTTAGTGCTGTTAAATTTAAATTCATAGATTATGATAGTAAGAAATATCCAGAAACTCATGATCCCAGGGAGGTTAACTACGCTAAAGAGTATGCTTCAAACACCGTTTTACAACTTAAGGCCAGATAA

Protein sequence:

>DPOGS206648-PA
MSFPDKQQRKVCWDSRDRYWACLDDQNIKDSSEKPKACAEFRRLFEKSCPPKWEVLIATKKASIQNEEYEEFNIILEMLWDHLKTGLEEFDDQLLCSTCFYNVLNKSTKQEFYLNQILIIMNDELEINEKNHDNISLATTLCYGLYQSSYLANDLKNTAVLENTMKSIFKLLVRMSYEYTQYTFIVFKTMSSFKKVIGTSLEMTIFSNENKIKMLHVINNNWENPITGIRNLNRILFQTLLQVINDDELTQIIREINKFYWNKAKYLMLTEVILKYSGNIVDYVLKNMWLGGLLNSLYKPGLVSAGADMYYAILEKLNSENCWTTLFLEPVIELLTGASFQAIENFKNYWCLNTFKKFPTLAKDLVQHLKLASDSELKLYSILCVLKQANKLALIDRKWTTASHNNVLEIISNQFAWEEDTLTSSDFSKMSDMVQTLINNSEYAEENNQDQTKISGFHQIVLNCLWLNVKAACDLASLLILYCKEYVEVCEKCLHIIMHVLETSRHKGAIEAAGAALGQGINYLTSTKLSPEVSQLPFILLKKKLNELLYETAKMSSVTRRGAGLSIMVHRIVSNDRKKGKPLFHYFMVELLTTCKSLDKNCEVEESDKDLPKAIYIHFLTRIVTDSSLASDIMYYSADLAELAFDNLTNSNWQIRNAALQLYGALIPKLIGQKKASGIDEETVATVACDEFLTHSPKLWKYIVKEINNNQSNDIIQTHSNLVPILNVLANIARRYNISCDVKEQSSLELDLLKNLTLLLGSPIYTVRRLAAKSITNLFCFDVVYDFIKTNNNDAENHIHGCIMLLSELHKIYSTEETLRTQLNELINNLNNKLLQGNYSFICRWGIEMLLQTENPKNITIELVQEIFLHTNANRYKPGVALWEKIKVQKYLQEAPLNVIPCILKMILACNEYENYLDILINRLETSDGNGQVTLEEIAYALLSSDLKLQSSHLWKILYLISLKTDLNCDMTQIMSLNTNLFKISFQMRYMIPFATRLCVTRGDNYLSVLSNIIQSLCNPETNDIDLRYIAAISNNELSHKFKSHNENVKINAIISAIILLQDEDEDVRNVSVAFYKNVVSDKNPKQPFVCLNAILDIGFLSIMLNQPKISIPKICTDLLSFIDGMSQTKGDEIIAMESKESARIPRDVFMETFKYFKSNKPKPSLDRVITTDSDNRLLLFHRRDIIEDVRSKYLGLRSLKEWKAYSLKSNPGLILIRNPFTSLGQRFWIRKCLEIYPKKPNRTNIDVETHIDDWWSECHYSERNNSKLLKKLRWTTLGYHHNWDTKVYTEENKGVFPSELSELADIVAHYLGYEGFRAEAAIVNYYHMSSTLSAHTDHSEINLEAPLFSFSFGQSSIFLIGGQDKFINPIPILLNSGDIVIMSKEARLCYHAVPKILPSPTTPWNNQEDSGGMDFSAVKFKFIDYDSKKYPETHDPREVNYAKEYASNTVLQLKAR-