Monarch geneset OGS2.0

DPOGS214994
TranscriptDPOGS214994-TA4923 bp
ProteinDPOGS214994-PA1640 aa
Genomic positionDPSCF300256 - 89897-102836
RNAseq coverage1215x (Rank: top 10%)
Annotation
HeliconiusHMEL0148312e-12838.24% 
BombyxBGIBMGA012159-TA2e-13839.24% 
Drosophilahts-PA4e-14053.89% 
EBI UniRef50UniRef50_E0VNI54e-14955.80%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VNI5_PEDHC
NCBI RefSeqXP_314339.40.049.67%AGAP004852-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582930440.049.67%AGAP004852-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571283547e-14555.19%adducin [Aedes aegypti]
Group
Gene OntologyGO:00468722.7e-43metal ion binding
KEGG pathwaybpt:Bpet26032e-08 
 K01628 (fucA)maps-> Fructose and mannose metabolism
InterPro domain[112-326] IPR0013032.7e-43Class II aldolase/adducin, N-terminal
Orthology groupMCL10530 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214994-TA
ATGGCGGATACGGATACAGAGACGCTTCCCAACGGAAACGCGACCCTTAGCGCCGAGGAGGAGGAGAGGCTGCGGCAGCGACCGGCTGACATCGATGCGGACGTCCGTGAGATGGAGCGCAGGAAGCGAGTGGAGGCGATCATGTCGTCCAAGATGTTCCGCGAGGAGCTGGAGCGAGTGCTGGACCAGCAGGCCAACGAGGGGAACGACGCCCCGCTGCTGCAGAGGATCCGCGAGATGGTGGGCGGCAGGCTGGGCGCGGGCAGCATGAGGGGTCCGAGTTGTATGTTGCCGATCAACGACATCCGCGGCATCGAGGGTGTGGGCTACGAGAAGGGGGAGAAGATACTCCGCTGTAAGCTGGCCTCCGTGTACCGGCTGGTGGACCTGTTCGGCTGGACCCAGAGTGGTATCTCCGGCCAGATCACCGCCCGCCTGAACACAGCCGTGGAGCAGGTGTTGACGACGCCGCGAGGTCTGCTGCCTCACGAGGTGACGGCCAGCTCTCTCGTGAAGCTGGACATGCAGGGCGCGGTTCAGGACCAGGGGACGACCAACTTCCCCGTCAACGTTGAGGGTTTCTCTCTGCACGCGTCGGTGCACGCCGCGCGGCCCGACCTCCGCTGTGTGCTGCACGTCCGGTCCGCAGCGGCCCTGGCCGTGTCCGCCAGCAAGCGACGCCTCATGCCGCTCTGCGGGGAAGCGGCGCTGGATCCTCTGCAGCGGATAGTGCGCGTGCCCGGAGGTGTTCTGGACAACGCCGAACGCGATAAGCTGGTCCGTGCCCTGGGACCTCACTCCAAGGTGTTGGTGCTGGCCGGTGGAGGAGCTCTCTGCTGCGGGGAGACCCTGGAAGAGGCCTTCTATCACGCCCGCATGTTGACAGCGGCCTGCGACGTCCAGCTTAGACTGGTCTCCATTCCCCAGGACGACCTGCTGCTGATAGACGAGGACACCAGGAGACAGATGTACGAGGCGTCCCGCAAGCCGCCGTCCGACTCCTCCAAGTGGCGCATCGGGGGCGAGGAGTTCGAGGCGCTGATGAGGATGATGGACAACGCCGGCTACAGGACCGGCCACGTCTACAGACACCCGCTCATCAAGAACGACGTGCCCAAGCCGAAGAGCGATGTAGAGGTGCCGCCAGCTGTGTCCTCGCTGGGGTACCTGCTGGAGGAAGAGGAGCTGTACAAGCAGGGATTGTGGAAGAAGGGCGGTAAATCCGGCGAGCGCACGCGCTGGTTGAACTCGCCGAATGTTTATCAGAAGGTGGAGCTCCTGGAGACGGGCACCTCGGACCCGAAACGAATCACCAAGGTAGAGTACCTCCGCCTGGAGGCCGATCGCTCCCCGGCTGATATACCTCCCAGTCAGTCCGCCTCGCCATCTCCCGGACACCCTGTATCCCCGCCAACACCCGCCGCCCATCCCCTACACCTTACAGCCCGAGCTCCCCAGCCACCGACACTGTCATCGTCCGAGCACAGCCCGAGCCATGTCCCTAGACCGGCTGCTCCTTGGGTGGGTGCTCGCGACGACGAGTGGGTCCAAGACGGGTCGCCGGCACACTCGACGCCGGTCAAGATCGACACTCTACAATTCGTGCCCAAAAATACCAATCCCAAGGAGTTCAAACAGCTTCAGCAGCAGATAAAAGAGAATCGCCGCGCGGACAAGATCAGCGCCGGGCCGCAGTCGCACATACTGGAGGGGGTGGCCAGGGACGAGGCGCGCCTGCTGCCCGAGGACGCGGCCGGCACGCACACCGGCGACCATGTGATCTTGATGGGAGCGGCGTCGAAGGGCATCATACAGCGAGGATATCAGCACAACGCGACCATGTACAGCGCGCCCTACGCCAGGAACCCCTTCGACCATGTCACGGACACCGACATAGACGAGTACAAGCGGGTGGTGGAGAGGAAACAGCGCGGCCAGGAGTTACGCATCTTACTTGCGGCGGATTTGAAACAGAAGTCTTATGATCATACCGTCCTGTGCGCAGACGACACGGACATATCAGAGTCGGAGGCTCTCAGCGCGCCGCAGCGCTCCGCCCCCGACACCGAGGACGAGTCGCGAGAAGAACAACGCGTGCTGAGAATAGAGACGAAGCAGGTCCCCGTGATGAGCCAGCCGGAGGTTGTGCTGAGTGATGACCCGTCAGATTTCTTGAATGCCGAACGCGACCACGCCGATCGGACCAGAGCGTCCCGTGACCCGTACCTCCTGTCTGATGACGAGGTGTTCCTGCCGGCGGAACCAAAGAAAATTGTGATCGTCAGTAGGACCACGACTAGCCTTAGCACTGTCGCCAGAGGTGAGAGGGGGGATAGTGGTGGTTCAAGGGATATCAGAACGCACGAGGCCCTATCATCCCACAAGCGCTCCCTGCACAAATACTCCCGGGCTCCCGGCTTTGACTGCACCGTACAGAAATATAACACAATACCCAAATGTAAAACCAGCCTTCACCATAAACTAGCCATGTCCAATTACGTGTCGTGTAATACGGACAGCGTGGGCAGTCGGTCCAGTCTGATATCGATACCGTCGATCGATTTAGATCGATGTAAAATCGATTCCTCATCTCGGAATCGATCCACATCGAAATGCAGTAACTTGTATCGGACGAATTCACACAGCGGCTTCATTAATTCGTATATAAAAGACCCGGAACCGACTCATTACCATTTGCCCAAGTGTTACTCTTGTGGGAGCATTGCCTACTCTTATCCGAGGTATTCGGGTGACGCTGTGAGGTATTTCTCGTGTGATAACATAGCTAGTAGAGCTTCAAATGATAAAGATGACAGACCTTTAAAGGTCAAAGTTAAAGAAGACACAGTACCCGTGTTCAAAGTGACGAACCAGGAGTTCGTTGATAGTCAGGAATTTAACGAGGATAATTATTTAGGTGACATCGGACACAGCTTCTCGTCAGAGAAACGGACGGTTGTTGAGGAAATGACGAATGCCAGTAGAATGGAATCCAACTTAGACGAGTTTAAGAATGACGCGAAGGAAAAAGATAGAGATGATATATATCGAGCCGTGTACGTTAAAGTAGACGAGGAGGCTGGTTCGGCTGAAGGAGAATATCACAGTTTTATAGACGATATGGACTTTGAAGAGGCCGTGTACGACAGTCCGGTGAAAGCTCTCGTTGACAAAGACATTAGAGACTATTCAGTGCCGATAGACTTCTACTGTGAGGACTATAATAGAAACGAGACGAGGAAACAGTCGCCAATGAAGGTGATAGAACACGCGGTGCAGACTGTCTTAGAGCCGATCCTAGAGGAGTCTAAGAGCTCGTCTGACTCGTGCAGCGAAAGCAACAAGACTGTCATATATAACGAAATCGGCTTCAGAGAACGCGTTGAGGGAAATCCAAATGGAACGAATGACGGGAGAATTTCGGGAAACGAATTAGAAAGAGAAAATTTAGCGCAACCCGAAAGTTATGTTACAACTAATGAAAATGATGACACAAACAAAAAAGCGCGAAACTTTGAGGAAGAAAGCAAGGGAGAGAAGAGAACCAGCTCCCTGAGCGGCGGAAGTTCCATGACCGTGCCTAGCCTAGATTCAACGGTGGAGTTTGAAAAATACGAAGTAATAGCTGACGTGATCAGCGTGATGTTGAAAAGATTTGAAGTTCAGGTCCAAGAGGAGTACAACAAACAAACCAACGACCAAACTCGCGTTGACTTACGATCAGGTGACGATAACGACGAAGACGATAAAGATGCCTTCTCTATCACGGAAATGGTTCTGGATTATGAGAAGAATTTAAGCTACAATAACACAGAGCTGAGCGAGAGCAGCGAGACCGAGGCGAGTTACCCACAGAGAATCGTTGAAGCTGTGGTCTACTTCATATTCGACAGGGCCGTTTACTGTTGTGACAGAAAAAATAAGAAGAGAGGTACCACGAAGAGGGTTGTCACCGTGGTGGACTGCGAGGACATTATATATACGACTTCTGATAAGATACTCCTCTTAAACGATACTTCTACCACGGATGTACCGAATGAAGACAGTTCCGGGCTGAGAGCTGTGGATGAAGTCCCAGTGTGTGATGATGATGCTAAAATGGGTTTTTTAAGTGAGAGTAGCCAGGACCCTGACTGCAGCCGTGACAGTGTGCTTCATAACGAGGTGACACCGAATTCCACGTTCGTCACCGACAGTGCGGACGTGTCGTTCACGTACTACAGCCAGACGGACGTACATGCTGGGATTGTAGACGATTTATTGGAGAGGTCGTTAAGAACATACGAGATCCCGAAGAATTCGAGCGAAAGTTTTGTTGCTGATGGCGAAGTCATGAATACGGCGTTTGTCGATTCCGATTTCTACGACAGATCGTCCTCTCCCCTGAGGAAGGTGTTCGAGGTGTGTACGGAGTCCCCCATCAGGAGGACAGGGTCCAGCCCAAACGAGTCCGTCTGTGACGCGGACTCGCCGTTTGTGAAGAAAATTAACGTAATATCGATGTCCCAGACGGTGCACAGCGGCGGCATCAAGTACTGGCTGTCCTTCGACGACAATTTAATTATCGAGAAGCCTTCACCGAAGTCGACCAAAAAATTCTCTGACAACAAGACACCCAGCTTCCTGGTTGTGGATTACGATAGGAAGACTGACTTCGTTGAAAAATGTTCCAGCATATTGACGGACAGCAGGTGTTTAGACGAAGCCGACGCCAGCACCTCCAATTACAACACCTGCGAGAGTTCAAGGTTCGATTCCACCACCGAAAAGTATATCTACAGTTCCAGGTCCAGGAGCAATAAAATACTGCACTCCACCTGGCCACCGTACGACGACACGCTGTTCTATAGAATCATATCTAAGTTCAGGCTCACGGAGAGTTTCGATTTCAATCAGCTGCAAAAGAGATGCGGCAGCTTCTGA

Protein sequence:

>DPOGS214994-PA
MADTDTETLPNGNATLSAEEEERLRQRPADIDADVREMERRKRVEAIMSSKMFREELERVLDQQANEGNDAPLLQRIREMVGGRLGAGSMRGPSCMLPINDIRGIEGVGYEKGEKILRCKLASVYRLVDLFGWTQSGISGQITARLNTAVEQVLTTPRGLLPHEVTASSLVKLDMQGAVQDQGTTNFPVNVEGFSLHASVHAARPDLRCVLHVRSAAALAVSASKRRLMPLCGEAALDPLQRIVRVPGGVLDNAERDKLVRALGPHSKVLVLAGGGALCCGETLEEAFYHARMLTAACDVQLRLVSIPQDDLLLIDEDTRRQMYEASRKPPSDSSKWRIGGEEFEALMRMMDNAGYRTGHVYRHPLIKNDVPKPKSDVEVPPAVSSLGYLLEEEELYKQGLWKKGGKSGERTRWLNSPNVYQKVELLETGTSDPKRITKVEYLRLEADRSPADIPPSQSASPSPGHPVSPPTPAAHPLHLTARAPQPPTLSSSEHSPSHVPRPAAPWVGARDDEWVQDGSPAHSTPVKIDTLQFVPKNTNPKEFKQLQQQIKENRRADKISAGPQSHILEGVARDEARLLPEDAAGTHTGDHVILMGAASKGIIQRGYQHNATMYSAPYARNPFDHVTDTDIDEYKRVVERKQRGQELRILLAADLKQKSYDHTVLCADDTDISESEALSAPQRSAPDTEDESREEQRVLRIETKQVPVMSQPEVVLSDDPSDFLNAERDHADRTRASRDPYLLSDDEVFLPAEPKKIVIVSRTTTSLSTVARGERGDSGGSRDIRTHEALSSHKRSLHKYSRAPGFDCTVQKYNTIPKCKTSLHHKLAMSNYVSCNTDSVGSRSSLISIPSIDLDRCKIDSSSRNRSTSKCSNLYRTNSHSGFINSYIKDPEPTHYHLPKCYSCGSIAYSYPRYSGDAVRYFSCDNIASRASNDKDDRPLKVKVKEDTVPVFKVTNQEFVDSQEFNEDNYLGDIGHSFSSEKRTVVEEMTNASRMESNLDEFKNDAKEKDRDDIYRAVYVKVDEEAGSAEGEYHSFIDDMDFEEAVYDSPVKALVDKDIRDYSVPIDFYCEDYNRNETRKQSPMKVIEHAVQTVLEPILEESKSSSDSCSESNKTVIYNEIGFRERVEGNPNGTNDGRISGNELERENLAQPESYVTTNENDDTNKKARNFEEESKGEKRTSSLSGGSSMTVPSLDSTVEFEKYEVIADVISVMLKRFEVQVQEEYNKQTNDQTRVDLRSGDDNDEDDKDAFSITEMVLDYEKNLSYNNTELSESSETEASYPQRIVEAVVYFIFDRAVYCCDRKNKKRGTTKRVVTVVDCEDIIYTTSDKILLLNDTSTTDVPNEDSSGLRAVDEVPVCDDDAKMGFLSESSQDPDCSRDSVLHNEVTPNSTFVTDSADVSFTYYSQTDVHAGIVDDLLERSLRTYEIPKNSSESFVADGEVMNTAFVDSDFYDRSSSPLRKVFEVCTESPIRRTGSSPNESVCDADSPFVKKINVISMSQTVHSGGIKYWLSFDDNLIIEKPSPKSTKKFSDNKTPSFLVVDYDRKTDFVEKCSSILTDSRCLDEADASTSNYNTCESSRFDSTTEKYIYSSRSRSNKILHSTWPPYDDTLFYRIISKFRLTESFDFNQLQKRCGSF-