Monarch geneset OGS2.0

DPOGS212161
TranscriptDPOGS212161-TA5808 bp
ProteinDPOGS212161-PA1935 aa
Genomic positionDPSCF300038 + 606500-628278
RNAseq coverage587x (Rank: top 22%)
Annotation
HeliconiusHMEL0125410.078.32% 
BombyxBGIBMGA006609-TA0.061.22% 
Drosophilal(1)G0196-PK0.066.62% 
EBI UniRef50UniRef50_Q9VR590.063.64%Inositol hexakisphosphate and diphosphoinositol-pentakisphosphate kinase n=16 Tax=Coelomata RepID=VIP1_DROME
NCBI RefSeqXP_975055.20.060.95%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892397070.060.95%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892397070.055.40%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00039935.8e-82acid phosphatase activity
KEGG pathway 
InterPro domain[899-1334] IPR0005605.8e-82Histidine phosphatase superfamily, clade-2
Orthology groupMCL11324 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212161-TA
ATGTCCTACACTGAGCTTGAACATGGCTATCAAGGGTTGCGACAAGCTACTCAGCCTCAGTTTTATGTCGGCGAAGACGGAAACCTTGTACCAATAAGATCCTCTATAGTATTGTGTTTGTGTGTGATGGAGTGGCAGTGGCTTAAGGACTGGTGGCGTTTAAAAAGATGGAGGAACAAAAAAAAGAAAGACCAAAGTGATGATGAGTTTTGCTACTGTGATGATTGTTTGCCTGACGGCGAGTTGGAGCTGTGTGATTCCCTGGACCCGCACTGCGATGGCGGGGGCAAGAGAGTGGTGGTGGGGGTTTGTGCCATGGCTAAGAAGTCACAGTCGAAGCCGATGAAGGAGATCCTCACCCGCCTCGACGAATTCGAGTTCATTAAGATGCTCGTCTTTCCAGAGGAGGTTATCCTTAAGAAGCCGGTCGAGGAATGGCCGATCTGTGACTGTTTAATCTCGTTCCACTCCAAGGGCTTCCCCCTGGACAAGGCCATACAGTACGAAAAATTAAGGAAGCCCTACGTAATTAATAATCTCCACATGCAATACGATATCCAGGATCGTCGGAGAGTTTATGCCATTTTAGAGAACGAAGGTATAGAGATACCTCGGTACGCTGTCCTTGATCGGGACTCGCCGGATCCTAAACATCATGAACTTGTTGAATCAGAAGACCACGTGGAGGTGAACGGCGTTGTATTCAATAAGCCGTTCGTCGAGAAGCCGGTGTCGGCTGAAGATCATAATATTTACATATACTACCCCACCTCGGCCGGCGGCGGCAGTCAGAGGCTGTTCAGGAAGATCGGTAGCAGGAGCAGTATATATTCACCTGAATCCAGAGTGAGGAAGACGGGTTCTTTCATATACGAAGACTTCATGCCAACAGATGGTACTGATGTTAAAGTGTATACTGTCGGCCCCGACTATGCCCACGCTGAGGCCAGAAAGAGTCCAGCGCTAGACGGGAAGGTCGAGCGAGACTCGGAGGGGAAGGAAATAAGATATCCTGTTATATTATCCAATCAAGAGAAACTTATATCAAGAAAAGTCTGCTTAGCCTTCAAGCAGACTGTGTGTGGATTTGATCTGTTGAGAGCGAACGGCAAATCGTTCGTGTGCGACGTGAACGGCTTCTCTTTCGTAAAGAACTCGAACAAGTACTACGACGACTGCGCCAAGATCCTCGGCAACATGATACTGCGCGAGCTGGCGCCCACGCTGCACATACCGTGGTCGGTGCCCTTCCAGCTGGACGACCCGCCCATTGTGCCCACGACCTTCGGCAAGATGATGGAGCTGAGGTGCGTGGTGGGCGTCATCAGGCACGGCGACCGCACGCCCAAGCAGAAGATGAAGGTGGAGGTGCGCCATCCGAGATTCTTCGAGATCTTCGAGAAGTACGAGGGCTTCAAGCGCGGTCACGTGAAACTGAAGAAGCCCAAACAACTACAGGAGATTCTGGATATAGCGCGCTCGCTACTAGCAGACATACACACACGGCACGCCGACCCGGAGATAGAGGAGAAACAGGGCAAGCTGGAACAACTCAAGAGCGTACTGGAGATCTGTTTATTCACTATTGTGCTGGAAATCGGTAGCAGGAGCAGTATATATTCACCTGAATCCAGAGTGAGGAAGACGGGTTCTTTCATATATGAAGACTTCATGCCAACAGATGGTACTGATGTTAAAGTGTATACTGTCGGCCCCGACTATGCCCACGCTGAGGCCAGAAAGAGTCCAGCGCTAGACGGGAAGGTCGAGCGAGACTCGGAGGGGAAGGAAATAAGATATCCTGTTATATTATCCAATCAAGAGAAACTTATATCAAGAAAAGTCTGCTTAGCGTTCAAGCAGACTGTGTGTGGATTTGATCTGTTGAGAGCGAACGGCAAATCGTTCGTGTGCGACGTGAACGGCTTCTCTTTCGTAAAGAACTCGAACAAGTACTACGACGACTGCGCCAAGATCCTCGGCAACATGATACTGCGCGAGCTGGCGCCCACGCTGCACATACCGTGGTCGGTGCCCTTCCAGCTGGACGACCCGCCCATTGTGCCCACGACCTTCGGCAAGATGATGGAGCTGAGGTGCGTGGTGGGCGTCATCAGGCACGGCGACCGCACGCCCAAGCAGAAGATGAAGGTGGAGGTGCGCCATCCGAGATTCTTTGAGATCTTCGAAAAGTACGAGGGCTTCAAGCGCGGTCACGTGAAACTGAAGAAGCCCAAACAACTACAGGAGATTCTGGATATAGCCCGCTCGCTACTAGCAGACATACACACACGGCACGCCGACCCGGAGATAGAGGAGAAACAGGGCAAGCTGGAACAACTCAAGAGCGTACTGGAGATGTACGGTCATTTTTCGGGCATCAATCGTAAGGTCCAGATGAAGTACCAGCCGCGCGGTCGACCGCGGGGCTCCAGCTCCGACGACGGTAGTGACGGTGAGCCTTCTCTGGTGTTGATTCTGAAGTGGGGCGGCGAGCTGACTCCGGCCGGGCGGATCCAGGCCGAGGAGCTGGGGCGGATGTTCAGATGCATGTACCCTGGAGGTCAAGGCAGACATATTCCAGGTGAAGGAGGCACTCAGGGGCTGGGCCTGTTGAGATTACATTCAACGTTCAGACACGATCTGAAGATATACGCCTCGGACGAAGGTCGCGTGCAGATGACGGCGGCCGCGTACGGTCATTTTTCGGGCATCAATCGTAAGGTCCAGATGAAGTACCAGCCGCGCGGTCGACCGCGGGGCTCCAGCTCCGACGACGGTAACGCCCCCGGTAGTGACGGTGAGCCTTCTCTGGTGTTGATTCTGAAGTGGGGCGGCGAGCTGACTCCGGCCGGGCGGATCCAGGCCGAGGAGCTGGGGCGGATGTTCAGATGCATGTACCCTGGAGGTCAAGGCAGACATATTCCAGGTGAAGGAGGCACTCAGGGGCTGGGCCTGTTGAGATTACATTCAACGTTCAGACACGATCTGAAGATATACGCCTCGGACGAAGGTCGCGTGCAGATGACGGCGGCCGCGTTCGCTAAGGGCTTGCTGGCGCTGGAGGGGGAACTCACGCCCATTTTGGTGCAAATGGTTAAATCCGCTAACACCAACGGCCTGCTAGACAACGACTGCGATTCATCCAAAGTACAGAATATGGCGAAGGCCCGTCTGCACGAGGCGCTCCAGGCCGACCGTTCGTTCAGCGCGTGCGACCGCGCTCGCGTCAACCCTTGCGGCTCGCTGAGCATAGCGGCCGCGTTAGAGTTCGTCGACAACCCTGCCCGGACCTGCGCTCACGTGCACAGCCTCATCAACAGTTTGGTGAGGATCGTGCTGGCCAAGAAGGACGATCCCAAGACTAAAGATACAATTCTATACCACGGCGAGACCTGGGAGTTGATGGGGCGGCGGTGGGGGAAGATAGAAAAGGATTTCTGCACGAAGAATAAAACCTACGACATATCCAAGATACCGGACATCTACGACTGCATCAAGTACGATCTGCAGCACAACCAGCACACGCTGCAGTTCGACCTGGCCGAGGAGCTGTACATATACGCCAAGTACTTGGCCGATATTGTTATACCACAAGAATATGGTCTAACAGTCCATGAGAAATTGACGATCGGACAAGGAATATGTACACCGCTGCTGAAGAAGATTAGAGCGGACTTGCAGAGAAATATAGAGGAGTCCGGAGAGGAGAACGTCAATAGGCTCAACCCAAGGTACAGCCACGGTGTGTCGAGTCCGGGAAGGCACGTGAGAACCAGACTGTACTTCACTAGCGAGAGTCACGTGCACTCCCTGCTCACCGTGCTGCGCTTCGGCGGCCTGCTCGACGTACTGAAAGACGAACAATGGCGGCGGGCTATGGAATACGTGTCCATGGTGTCCGAGTTGAATTACATGTCACAGATCGTGGTGATGCTGTATGAAGATCCCACCAAGGATCCGTACTCGGAGGAGAGGTTCCACGTGGAGCTGCATTTCAGTCCAGGTGTGAACTGCTGCGTCCAGAAGAACCTGCCGCCCGGCCCGGGGTTCAGACCGCACTCGCGGAACCATTCAACAGCCAACAACGATCAGAGCCCGTCGGACGAGGTGAAGTGTATAGAGGAGGAGACGGAAGACGACCAAATGGCCAGCCAGGAAACATTACAGCCGGACGCCTCCGACGAGTTTGATAACACATTCTCTCCCAGTAAAACGTCCAAGTCAAAGTTGAGGCCTTCGGATCCCATACCTATTTGCAACCTGCACTACACAGTGAGCGGTCACGAGGCGTCCTCCCTTGCGGCGCGACTCAGCGAGGAGCTCCGCGCGCGGCATCCGAAAGACGACGGACACGCGCTGCAGCACGCGAGCGAGGAGAGCGAGTCCAAAGCGTCAATCGAGCCGCAGCCGCGGGCCCGCAGCTACGACCAGAACAAACAGACCAACGACAATGCTCCCCCCACGCCCTCTCCTCCCGCGGTGGACGTGTCCGGTGTGGAGTCGTCCACTAACGAGCTGCCCCTGTCCGCGCTGACGTTCAGTCGCAAAAGCGCGGGCGGGGCGGACGGCTGTATCGTGCGGGGCGCAGGGCGGCGCCAGCGGCACAGCATCGCCGGCCAGATGAGCTACCTCAAGATGTTGGGTCTGGGGGCGAGGGTGAAGCTGCCGGGCGCCGCGGGACTGTTCAGCACCGCCGTCATCTCGGGGTCCAGCTCGGCGCCCAACCTGCGAGTCATGATACCCGCGTCCTCCGCATCCAACGCGGCGTTGGAGGGTTTCGGTGGCGTGCCGGCCATCCGGCCGCTGGAGACGCTGCACAACGCGTTGTCTTTGAGGCAGTTGGACGCCTTCCTGGAGAGAGTGACGGACACTGCGGCCAAGGCAGGGCCGCCGGCAGCCGCGCCCGCCAAACAACCGTCGCCCACTAATAGCGTCGGTTGGAGCGGTCCGTCGTCTTTGATGTCTTCGTCGGGAGAGTTACCGAGCGGCCTTTCCTCCGCCGAGCCCTCCTCCCCCAACAACAACTACGCAGACAACAAGACGGAGAGTAGCAATTGGAGTAAGACTGCAGCTGGGAAGTTACCCGACTGGGGCACGGACGCCGCGCTACGAGCGCTCGTCGTGGCTGCACCAGGTTCGTCTATTAGCGGCGACGTGACGCCGGTGTCAGTGGCGTCCGAGCTGGAGTTCAACAGTAACGAGGCCACCGCCGGCTCCATGACGGATCACGACCTGATGAGTGCTGAAGGCGAAGACGATGAAGAAACCTTATCAGCAGACGCGTGCGGTTCGTCTGTCAAATATGTTAGAAATGATCTGTCAGATGCTAATATGTCAAAAAATAGTACATTAGAGAACAGCGAACCTTCCTCTAGCGCAGCTTATGTAGGATCGTACGATGTCAATTATCAACATATTAAGGATACGTACAGTGATGTGGCATCAAGTAGTAAAATTTATAACGTAGGTGATGATGTAAGCAACACCTCGACTAATGATACGTTGCTATACAGTTCCGAAGTTTCAAATTTACAAAGCGAGGAATTTAAAAGTAAGACAAACAATGGCCTTAAAAATGTACGTAAAATGTATAATGATAGTAGTGGTAGCAAAAATAACAATCAGTCCGGATCAGGGAGGTTCATTACGACATTGGTTTCCGAAGAGTTATTGGAAGGGAATACATCGAGTGAACCCAAATATAGTACAATAGAGCCAGCGGAGCTGACGCGTCTCGTCGGACCGTTGGTCGTTAAGGCTAAAGTCGGTAACGTTACACCGGGTTTTACAATTGACGAAGATTAA

Protein sequence:

>DPOGS212161-PA
MSYTELEHGYQGLRQATQPQFYVGEDGNLVPIRSSIVLCLCVMEWQWLKDWWRLKRWRNKKKKDQSDDEFCYCDDCLPDGELELCDSLDPHCDGGGKRVVVGVCAMAKKSQSKPMKEILTRLDEFEFIKMLVFPEEVILKKPVEEWPICDCLISFHSKGFPLDKAIQYEKLRKPYVINNLHMQYDIQDRRRVYAILENEGIEIPRYAVLDRDSPDPKHHELVESEDHVEVNGVVFNKPFVEKPVSAEDHNIYIYYPTSAGGGSQRLFRKIGSRSSIYSPESRVRKTGSFIYEDFMPTDGTDVKVYTVGPDYAHAEARKSPALDGKVERDSEGKEIRYPVILSNQEKLISRKVCLAFKQTVCGFDLLRANGKSFVCDVNGFSFVKNSNKYYDDCAKILGNMILRELAPTLHIPWSVPFQLDDPPIVPTTFGKMMELRCVVGVIRHGDRTPKQKMKVEVRHPRFFEIFEKYEGFKRGHVKLKKPKQLQEILDIARSLLADIHTRHADPEIEEKQGKLEQLKSVLEICLFTIVLEIGSRSSIYSPESRVRKTGSFIYEDFMPTDGTDVKVYTVGPDYAHAEARKSPALDGKVERDSEGKEIRYPVILSNQEKLISRKVCLAFKQTVCGFDLLRANGKSFVCDVNGFSFVKNSNKYYDDCAKILGNMILRELAPTLHIPWSVPFQLDDPPIVPTTFGKMMELRCVVGVIRHGDRTPKQKMKVEVRHPRFFEIFEKYEGFKRGHVKLKKPKQLQEILDIARSLLADIHTRHADPEIEEKQGKLEQLKSVLEMYGHFSGINRKVQMKYQPRGRPRGSSSDDGSDGEPSLVLILKWGGELTPAGRIQAEELGRMFRCMYPGGQGRHIPGEGGTQGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAYGHFSGINRKVQMKYQPRGRPRGSSSDDGNAPGSDGEPSLVLILKWGGELTPAGRIQAEELGRMFRCMYPGGQGRHIPGEGGTQGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFAKGLLALEGELTPILVQMVKSANTNGLLDNDCDSSKVQNMAKARLHEALQADRSFSACDRARVNPCGSLSIAAALEFVDNPARTCAHVHSLINSLVRIVLAKKDDPKTKDTILYHGETWELMGRRWGKIEKDFCTKNKTYDISKIPDIYDCIKYDLQHNQHTLQFDLAEELYIYAKYLADIVIPQEYGLTVHEKLTIGQGICTPLLKKIRADLQRNIEESGEENVNRLNPRYSHGVSSPGRHVRTRLYFTSESHVHSLLTVLRFGGLLDVLKDEQWRRAMEYVSMVSELNYMSQIVVMLYEDPTKDPYSEERFHVELHFSPGVNCCVQKNLPPGPGFRPHSRNHSTANNDQSPSDEVKCIEEETEDDQMASQETLQPDASDEFDNTFSPSKTSKSKLRPSDPIPICNLHYTVSGHEASSLAARLSEELRARHPKDDGHALQHASEESESKASIEPQPRARSYDQNKQTNDNAPPTPSPPAVDVSGVESSTNELPLSALTFSRKSAGGADGCIVRGAGRRQRHSIAGQMSYLKMLGLGARVKLPGAAGLFSTAVISGSSSAPNLRVMIPASSASNAALEGFGGVPAIRPLETLHNALSLRQLDAFLERVTDTAAKAGPPAAAPAKQPSPTNSVGWSGPSSLMSSSGELPSGLSSAEPSSPNNNYADNKTESSNWSKTAAGKLPDWGTDAALRALVVAAPGSSISGDVTPVSVASELEFNSNEATAGSMTDHDLMSAEGEDDEETLSADACGSSVKYVRNDLSDANMSKNSTLENSEPSSSAAYVGSYDVNYQHIKDTYSDVASSSKIYNVGDDVSNTSTNDTLLYSSEVSNLQSEEFKSKTNNGLKNVRKMYNDSSGSKNNNQSGSGRFITTLVSEELLEGNTSSEPKYSTIEPAELTRLVGPLVVKAKVGNVTPGFTIDED-