Monarch geneset OGS2.0

DPOGS215192
TranscriptDPOGS215192-TA1470 bp
ProteinDPOGS215192-PA489 aa
Genomic positionDPSCF300143 - 270529-274089
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0092641e-17668.71% 
BombyxBGIBMGA008669-TA8e-11750.80% 
DrosophilaOatp33Eb-PA4e-2827.18% 
EBI UniRef50UniRef50_Q9VK826e-2627.18%Organic anion transporting polypeptide 33Eb n=11 Tax=Drosophila RepID=Q9VK82_DROME
NCBI RefSeqXP_319188.44e-3026.05%AGAP010043-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582990788e-2926.05%AGAP010043-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582990782e-2924.24%AGAP010043-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160201.7e-18membrane
GO:00068101.7e-18transport
GO:00052151.7e-18transporter activity
KEGG pathway 
InterPro domain[132-446] IPR0041561.7e-18Organic anion transporter polypeptide OATP
Orthology groupMCL19893 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215192-TA
ATGTTTAAACTTAACAACGAAGATAACAAAAACAAAACAAATCATAGCACAGCAACTAGCGGGCATCACACGGACACACACGGCAGCGATGCTTTCGAATGCGAGCAAACGAATGAGCATGGCACGATGGCGGTAATGCCTGAACTGTGTGGAGGCGGTTCAATATCCGGGTACAATGATGTTGAGGCAGTACCAATCGACCATTACCTTCAAGCAAGGACCGGATTCCTTGTGCTCACGGCTGTTCTGTGCGCGCTCACGAAAATAAGCGTCTGGGCACACGGCATCACATATCTAGATGACCACGAACCTGAGAGTGGAACATACTTTTATGGTATCCTAATCTCTATCAGATTGTCTCTGGGTCTAAGCGCTCAGAACTGGTTGCAGCACGTGTCAGTGCGTGATGACTGGTGGGAGGCTCAGGTTTCCCTGGCCATGCTCACTCTGATGTTCTCAATTCTTTTCACGTTGTTCCCTCGTAAAATGGAAGGATATAAAGATTTCGAAGAGTTGGAATATAACTGTATTTTAGCTCCAATCGGTCGTATGCTACGCAACAAAGCGTTGATGCTCCAAGTAGCGGCTCTCTCAATCCTCAACACCGCTGTATTTGGTTACGTAAACTTTGATACTGCTTCCATACAGGCTAAGTTCTTCGTGGAAACTCTACGCCAAGATCCTCGAACAGTGCGGACAATCATGGACATCTTTAGATCGCTTGTTATCATTTTCTTCGTGTCTATATTCCGCATGCGTTTCTCGGGCCGCCGTAGCGACGGTGTGAAATCTAACACAGCGTCACGTGTAGGCGGGGCTGTATGTGTTCTGGTGGCGGCCTTCTTTGCGGTGCTGGCTGGCCTGCACTGTAATACCGGAGAGCTAGCAGGGTTCGGAGGTCTCACGGAGGAATACGAGCAGCCCTCGTGTAGCGCGCAGTGCGGGTGCGGCTCCGAGAAGTACGGCTTCAGTCCGGTCTGTATTCTGAACACCTCAACAACATATTTTTCTCCGTGCCACGCTGGCTGCCGCGAGTATGAGGACTTAGGGGGATTCTTACTGTTCAGTGAGTGTGCGTGTGGCTCTGGACGTGCGGTTAGAGGTTCCTGTAACCTGGCTTCCTGCTGGCTGCCCTATTCCCTGTACCTCGTTTTCTTCACACTCATGCTTGCTAGCAGTGCTGCGTCGTTCCTCATGCAAGGGATGGCTATATTAAGAGCTGTTCCCCGCCGGGATAAACCCATCGCTATCGGGGTCGCCTTCTCCATCGTGGGTCTGACCGCCCACGGCCTCGGCCACTTGCTCTATATGGTTATAGGATATTTAACCTGCGGTTATAGCGACGGCGAGACCTGTCTGCTGCACGATTACAGCATTTGGATTGTGGGTGCCGCAAGTGCCGTATTAGCTGTGCTGTCCGGGGCAATAAGTATCTTAGCTAGCAGGTGTTCCAATTCCAATTCCGGCTGA

Protein sequence:

>DPOGS215192-PA
MFKLNNEDNKNKTNHSTATSGHHTDTHGSDAFECEQTNEHGTMAVMPELCGGGSISGYNDVEAVPIDHYLQARTGFLVLTAVLCALTKISVWAHGITYLDDHEPESGTYFYGILISIRLSLGLSAQNWLQHVSVRDDWWEAQVSLAMLTLMFSILFTLFPRKMEGYKDFEELEYNCILAPIGRMLRNKALMLQVAALSILNTAVFGYVNFDTASIQAKFFVETLRQDPRTVRTIMDIFRSLVIIFFVSIFRMRFSGRRSDGVKSNTASRVGGAVCVLVAAFFAVLAGLHCNTGELAGFGGLTEEYEQPSCSAQCGCGSEKYGFSPVCILNTSTTYFSPCHAGCREYEDLGGFLLFSECACGSGRAVRGSCNLASCWLPYSLYLVFFTLMLASSAASFLMQGMAILRAVPRRDKPIAIGVAFSIVGLTAHGLGHLLYMVIGYLTCGYSDGETCLLHDYSIWIVGAASAVLAVLSGAISILASRCSNSNSG-