Monarch geneset OGS2.0

DPOGS208848
TranscriptDPOGS208848-TA2523 bp
ProteinDPOGS208848-PA840 aa
Genomic positionDPSCF300528 - 36936-59037
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0115880.066.32% 
BombyxBGIBMGA011390-TA3e-15744.98% 
Drosophilahoe1-PC4e-14842.31% 
EBI UniRef50UniRef50_Q8IGX67e-14642.31%RE09889p n=21 Tax=Arthropoda RepID=Q8IGX6_DROME
NCBI RefSeqXP_320080.32e-14943.14%AGAP009284-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3454924844e-14841.68%PREDICTED: P protein-like [Nasonia vitripennis]
NCBI nr blastxgi|1583000871e-14643.92%AGAP009284-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550857.8e-69transmembrane transport
GO:00157467.8e-69citrate transport
GO:00160217.8e-69integral to membrane
GO:00151377.8e-69citrate transmembrane transporter activity
KEGG pathway 
InterPro domain[294-654] IPR0046807.8e-69Divalent ion symporter
Orthology groupMCL10326 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208848-TA
ATGAATAAAGATAAGAGCAACAGTGTTTATTCTATGGTGTCATTCAGTGACCTGACGCCAGGAAGTTTGGATGTGTGGGTGGACCTGCCGGACGCCATAAAATATGATCCGACCCTGGCTCCCTTCAAGCAGATGTACGAACAGAAACATGGCAAGGATCTGTCTAACGTCGAGGTCGAAGTACCAGCAGGTGATGACATAAACAAAAATAACAAAATAGCAGAAGAAAATTTAGTTTGCGAAAAGCAAATACGTGATAAAGATGTTGAGAACAGATTGGTGGAAGATATTAGTCCTGATGACGAAAAAACATTACCGAAACGTAAGAAACAGCTGGACCGCACGCTACGAGTAATAAAACTGTCAGTACTGGTGGCCGGCTGGGTGATGCTAACTGTTTCTTTGCTGATGAACAGAGAGAAGACCGACATTATTCTCCATACAGCTGTAAATGCTGGAGAAATCAAAGAATATTTTTTGGGGTCGTCGAGTGAGGAGTTTAGGGTCGCTATTTCTTTGACCGGCCCCTTCACTGACTCTTCCACCAACGCGACCTCGTCACTCCAGCTCTGGCTGCACAGAACATCCAAATACAAAGAAGATGAACAGGATTCGCCAGCGTGGAGTATTAATCTTCAGCCAGATGACGTCATAGACTTCTCTCCCAGCGCATCCGAGGATAAGGTCCTCATGATAGATAAGCAAACCTTCATAAACAATGAGACGTACGAAGAAAAAAATGCGTCCGAACACGGAGTGAACGAATCCAGAATCTTTCTATGTCTCAATAGCAGCAGTAGTCAAGCCGTTCCTCTTACCATCAGTCTTCACGGAAAACCACTGTCTGAAACCGAAGGACTTATATACGCGAGCGTCCTCTTAGCCACGCTGTATATTCTTATAATATTTGAGATAGTTAACCGAACGCTAGCAGCCCTATTGTCGTCTTCCTTGGGTGTGGCAACCCTAGCGCTGGTCGGGGAACGTCCTTCCCTCCCCGAGCTGATCTCGTGGCTGGATGTGGAAACACTCTTGCTGCTCTTCAGCATGATGATACTCGTCGCCATAATCGCGGAAACCGGATTGTTTGATTTCCTCGCTGTTAAGGCTTTCGAGATAACGGCGGGCAGGACCTGGCCTTTGATTAACTGCCTCTGTTTCTTTACCGCATTCTTTTCAACGTTCCTCGACAACGTAACCACAGTCCTTCTGATGACGCCAGTCACTATACGGTTATGCGAGGTGATGCAGCTGAATCCGGTTCCAGTTCTGATGTCCATGGTCATTTTTAGCAATGTAGGCGGCGCGGCCACGCCTGTTGGAGATCCTCCAAATGTGATCATAGCCAGTCACCCCTCCATACTCGCTGTGAACATAAACTTCACGTCTTTCACCCTCCACATGGGTCTGGGTATACTCCTGGTGTGCATACAGACATACGTACAGCTGAGGTTCATGTTCAGGGACATGAACAGTCTAAGACACTGCGTGCCACGCGATATACTTGAATTGCGTCAAGAAATCAGCGTGTGGAAGCGCGCGGCCGCGTCATTATCATCTTACTCGAGAGACGAAGACATCGTCAGACGAGCGCTGGAGAAGAAGCTCACCTGGTCGCAGGTGCAGAGACTGAAGTCGACTCTCGGAAGAAGGGAGGCTGGCGGAGGCAATGACAAACTCTTCTGTTCAACTCTCGCTCATATGAAGGATAAGTATCGAATAAGGGACAAGGCGTTGCTAGTGAAGAGTGGTGTGTGTATTAGCTTCGTCGTTCTGGTCTTCTTCCTCCACGCTGTGCCTGAGCTACAGAGTTTGTCTCTGGGCTGGACGGCCTTGCTGGGAGCCCTGCTACTTCTGCTGCTGTCTGAGCGCGAAGACCTGGAACCTGTGCTGGCTAGAGTTGAATGGTCCACACTGCTGTTCTTTGCAGCTCTATTTGTGATGATGGAGGTAAATGAAAATAGCGGCGAGAGTTTGTCTCTGGGCTGGACGGCCTTGCTGGGAGCCCTGCTACTTCTGCTGCTGTCTGAGCGCGAAGACCTGGAACCTGTGCTGGCTAGAGTTGAATGGTCCACACTGCTGTTCTTTGCAGCTCTATTTGTGATGATGGAGGTGTTATCGAAGTTAGGTCTCATAGCGTGGATAGGGAGGATGACTGAAACTGTGATATCCCAAGTCGGCGAGGACTCTAGACTGGCTGTGGCTGTCATGCTGATACTTTGGGGAAACCAGAGTGATGTGCTAGAGTTACTCACAGCTAGTGATGTATCGGACAGAAATATTGTTACCGCGATGGTGGAAAGGAGTGAAAAGCTAAAGGCCATGTTCTCTCCAATCAAGGAACGCGCAGAGCGTCTGGTCATGATGCAGTCGCTGGCTGGGCTCGCAGCGATACTGTCGAGGTTCTCCGTCCAGCCTGCGCCGGGAGCGCCTCGAAAACCAACCATCGATACTCGCTCCAACATTGTCCAAGTAATACGAGGAGGTTTACCGCTCATTTTCACAGAGAGACGACCACACTGA

Protein sequence:

>DPOGS208848-PA
MNKDKSNSVYSMVSFSDLTPGSLDVWVDLPDAIKYDPTLAPFKQMYEQKHGKDLSNVEVEVPAGDDINKNNKIAEENLVCEKQIRDKDVENRLVEDISPDDEKTLPKRKKQLDRTLRVIKLSVLVAGWVMLTVSLLMNREKTDIILHTAVNAGEIKEYFLGSSSEEFRVAISLTGPFTDSSTNATSSLQLWLHRTSKYKEDEQDSPAWSINLQPDDVIDFSPSASEDKVLMIDKQTFINNETYEEKNASEHGVNESRIFLCLNSSSSQAVPLTISLHGKPLSETEGLIYASVLLATLYILIIFEIVNRTLAALLSSSLGVATLALVGERPSLPELISWLDVETLLLLFSMMILVAIIAETGLFDFLAVKAFEITAGRTWPLINCLCFFTAFFSTFLDNVTTVLLMTPVTIRLCEVMQLNPVPVLMSMVIFSNVGGAATPVGDPPNVIIASHPSILAVNINFTSFTLHMGLGILLVCIQTYVQLRFMFRDMNSLRHCVPRDILELRQEISVWKRAAASLSSYSRDEDIVRRALEKKLTWSQVQRLKSTLGRREAGGGNDKLFCSTLAHMKDKYRIRDKALLVKSGVCISFVVLVFFLHAVPELQSLSLGWTALLGALLLLLLSEREDLEPVLARVEWSTLLFFAALFVMMEVNENSGESLSLGWTALLGALLLLLLSEREDLEPVLARVEWSTLLFFAALFVMMEVLSKLGLIAWIGRMTETVISQVGEDSRLAVAVMLILWGNQSDVLELLTASDVSDRNIVTAMVERSEKLKAMFSPIKERAERLVMMQSLAGLAAILSRFSVQPAPGAPRKPTIDTRSNIVQVIRGGLPLIFTERRPH-