Monarch geneset OGS2.0

DPOGS215190
TranscriptDPOGS215190-TA1671 bp
ProteinDPOGS215190-PA556 aa
Genomic positionDPSCF300143 - 283221-288439
RNAseq coverage27x (Rank: top 77%)
Annotation
HeliconiusHMEL0092640.070.27% 
BombyxBGIBMGA008669-TA3e-15151.37% 
DrosophilaOatp33Eb-PA4e-2726.19% 
EBI UniRef50UniRef50_Q171D65e-2723.92%Organic anion transporter n=4 Tax=Culicidae RepID=Q171D6_AEDAE
NCBI RefSeqXP_319188.44e-3126.30%AGAP010043-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582990787e-3026.30%AGAP010043-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582990783e-3124.12%AGAP010043-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160203.1e-18membrane
GO:00068103.1e-18transport
GO:00052153.1e-18transporter activity
KEGG pathway 
InterPro domain[199-513] IPR0041563.1e-18Organic anion transporter polypeptide OATP
Orthology groupMCL19893 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215190-TA
ATGATGGTCATATCTAAAGGTTGGTACCTTCTCCGGAGGTATATTCTAACTATCCCTCGATTCGACCTGTTCCTGCAGGGAGCTCTACTCATTGTCGTGTTCCTGGAGAGCTACTCATATCTTCTAATAAGAAGAAATGCCGGCACGGGTTACTTGTCATCAATCAATGAAGATTGGGTGAAGATCGGCGTGGCTGGAGCTGAGTTCTTCCTCGGTTCTGTGGTAGCTTGGTCCGGGAGGGGTGTGAGACACTTCGCATTATCTGGATGGTTGGGCCTGACAGCGGTGTCGGGTCTGATAGTTCTGGCCTTCCCATACCCTGACTCCGGACGACGCTCTGTCCAACTGTGTGGAGGCGGTTCAATATCCGGGTACAATGATGTTGAGGCAGTACCAATCGACCATTACCTTCAAGCAAGGACCGGATTCCTTGTGCTCACGGCTGTTCTGTGCGCGCTCACGAAAATAAGCGTCTGGGCACACGGCATCACATATCTAGATGACCACGAACCTGAGAGTGGAACATACTTTTATGGTATCCTAATCTCTATCAGATTGTCTCTGGGTCTAAGCGCTCAGAACTGGTTGCAGCACGTGTCAGTGCGTGATGACTGGTGGGAGGCTCAGGTTTCCCTGGCCATGCTCACTCTGATGTTCTCAATTCTTTTCACGTTGTTCCCTCATAAAATGGAAGGATATAAAGATTTCGAAGAGTTGGAATATAACTGTATTTTAGCTCCAATCGGTCGTATGCTACGCAACAAAGCGTTGATGCTCCAAGTAGCGGCTCTCTCAATCCTCAACACCGCTGTATTTGGTTACGTAAACTTTGATACTGCTTCCATACAGGCTAAGTTCTTCGTGGAAACTCTACGCCAAGATCCTCGAACAGTGCGGACAATCATGGACATCTTTAGATCGCTTGTTATCATTTTCTTCGTGTCTATATTCCGCATGCGTTTCTCGGGCCGCCGTAGCGACGGTGTGAAATCTAACACAGCGTCACGTGTAGGCGGGGCTGTATGTGTTCTGGTGGCGGCCTTCTTTGCGGTGCTGGCTGGCCTGCACTGTAATACCGGAGAGCTAGCAGGGTTCGGAGGTCTCACGGAGGAATACGAGCAGCCCTCGTGTAGCGCGCAGTGCGGGTGCGGCTCCGAGAAGTACGGCTTCAGTCCGGTCTGTATTCTGAACACCTCAACAACATATTTTTCTCCGTGCCACGCTGGCTGCCGCGAGTATGAGGACTTAGGGGGATTCTTACTGTTCAGTGAGTGTGCGTGTGGCTCTGGACGTGCGGTTAGAGGTTCCTGTAACCTGGCTTCCTGCTGGCTGCCCTATTCCCTGTACCTCGTTTTCTTCACACTCATGCTTGCTAGCAGTGCTGCGTCGTTCCTCATGCAAGGGATGGCTATATTAAGAGCTGTTCCCCGCCGGGATAAACCCATCGCTATCGGGGTCGCCTTCTCCATCGTGGGTCTGACCGCCCACGGCCTCGGCCACTTGCTCTATATGGTTATAGGATATTTAACCTGCGGTTATAGCGACGGCGAGACCTGTCTGCTGCACGATTACAGCATTTGGATTGTGGGTGCCGCAAGTGCCGTATTAGCTGTGCTGTCCGGGGCAATAAGTATCTTAGCTAGCAGGTGTTCCAATTCCAATTCCGGCTGA

Protein sequence:

>DPOGS215190-PA
MMVISKGWYLLRRYILTIPRFDLFLQGALLIVVFLESYSYLLIRRNAGTGYLSSINEDWVKIGVAGAEFFLGSVVAWSGRGVRHFALSGWLGLTAVSGLIVLAFPYPDSGRRSVQLCGGGSISGYNDVEAVPIDHYLQARTGFLVLTAVLCALTKISVWAHGITYLDDHEPESGTYFYGILISIRLSLGLSAQNWLQHVSVRDDWWEAQVSLAMLTLMFSILFTLFPHKMEGYKDFEELEYNCILAPIGRMLRNKALMLQVAALSILNTAVFGYVNFDTASIQAKFFVETLRQDPRTVRTIMDIFRSLVIIFFVSIFRMRFSGRRSDGVKSNTASRVGGAVCVLVAAFFAVLAGLHCNTGELAGFGGLTEEYEQPSCSAQCGCGSEKYGFSPVCILNTSTTYFSPCHAGCREYEDLGGFLLFSECACGSGRAVRGSCNLASCWLPYSLYLVFFTLMLASSAASFLMQGMAILRAVPRRDKPIAIGVAFSIVGLTAHGLGHLLYMVIGYLTCGYSDGETCLLHDYSIWIVGAASAVLAVLSGAISILASRCSNSNSG-