Monarch geneset OGS2.0

DPOGS202187
TranscriptDPOGS202187-TA2178 bp
ProteinDPOGS202187-PA725 aa
Genomic positionDPSCF300149 - 562206-590436
RNAseq coverage287x (Rank: top 38%)
Annotation
HeliconiusHMEL0091740.075.37% 
BombyxBGIBMGA013485-TA0.071.23% 
DrosophilaOatp26F-PA0.054.96% 
EBI UniRef50UniRef50_B4KJT10.056.85%GI24249 n=3 Tax=Arthropoda RepID=B4KJT1_DROMO
NCBI RefSeqXP_001660406.10.055.60%organic anion transporter [Aedes aegypti]
NCBI nr blastpgi|3838630930.058.21%PREDICTED: solute carrier organic anion transporter family member 4A1-like [Megachile rotundata]
NCBI nr blastxgi|3838630930.058.21%PREDICTED: solute carrier organic anion transporter family member 4A1-like [Megachile rotundata]
Group
Gene OntologyGO:00160200membrane
GO:00068100transport
GO:00052150transporter activity
KEGG pathway 
InterPro domain[2-676] IPR0041560Organic anion transporter polypeptide OATP
[55-680] IPR0161961.2e-39Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL12892 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202187-TA
ATGCGCGACATATTTACTTGGCGAAATGCTCTCGTTTTGTTCACTGAAGTCGCAAAGGGTCAGTGCACAATGACTGTGAGCGACTCCGGGGGCTGCGTCAACAAGGCCTTCGATGCTGTAGACGATGGCTTCCACACCATCGATCTGCGGACCAGGAGGGAGGACGGCCCACAGGCGGACAGAGCTGATCGCGAGTCTCGTTGCGGCTGGGGTGCACTGCGGCCGGCTTGGTTGCAGCGCTTTAGAACTGCTAAATGGGCGTTGTTCTGGCTCTGCTGGGCAGGCGCTATACAGGGCATGGTGGTGAATGGGTTCGTGAATGTGGTGATCACCACCATCGAGAGGAGGTTCGGCCTCCGCTCCATGCAGACAGGAGTCATCGCTGGAGGGTATGATATGGCATCGTTTCTCTGCCTGGCTCCGGTGACGTATCTGGGCGGACGGACCGCCGCGTCAAAGCCTCGCTGGCTCGGCTGGGGCGTGCTGTTGATGGGTGTGGGCTCCCTGCTGTTCGCGATGCCGCACTTCCTCGTGCCGCAGTACAAAGTTGCCGGCGAAGAGGAAGACGATCTCTGCAGAGTAAATAGGACACTTTCGAGCGCTCCCTGCTCTATCTCCGGCGAGGGCTCGTGGGCCGTGGCAGTGTTCGTGGCTGCCCAGCTTCTGCACGGAGCTGGGGCCACGCCTCTCTTCACTCTCGGAGTCACCTACATAGACGAGAACGTCTCTAAGAAGATGTCTTCCGTTTATTTGGGTGTTTATTACACTATGGCCGTGGTGGGTCCGGCGCTGGGGTACGTCGTGGGAGGACAAATGCTGCAATTATATACGGACTTTTTGACCGTGGATTCCGAGACGCTGGGTATAACTCCCGTGAGCTCTGTGTGGATCGGTGCGTGGTGGGTGGGGTTCATCCTCTCGGCTGTTCTCTGCCTGGTGGTCGCGGTGCCACTCCTCGCCTTTCCATACGAGTTGCCGGGTGCGGATGAAATAAAAGCGTCTAAAGTATCCGAAGCCCACGAGAACGCGACCAAGTCAGCAGCTTTCACAGCTCTCCGCGAGTTGCCGTGCGCTGCTGTAGAGTTGTTTAGGAATCCGACCTTCATGTTCCTGAACCTGGCCGGCGCCAGCGAGGGTATGCTGATCTCTGGGTTCGCGGCCTTCCTGCCGAAGCTCATAGAGAACCAGTTCGGTGTGAGCGCGTCGCAGGCGGCTCTATTACTAGGTGTCATCACGGTGCCGGCGGGCGGGGGCGGCACGTTTCTGGGCGGCTGGCTCGTGAAGCGCTGGCGCCTGGCGTGTGCGGGTATCATCAAGCTGTGCGTGGCCTCGACCCTGCTGGCGGCCGTGTTCTGCTTCTCCTTCGTCCTCAGCTGTGATGACTCTCCCTTCTCGGGAGTCACCGTCCCTTACGACAGCCCCTCTGTTCCCGGCGGAGACGGTCTGTTGGCTCAGTGCAACGCCGCGTGCGGTTGCTCTGACCTGGCCGGGGTCTGCGGGGCCGACGGGAAGGCCTACGGGTCCCCGTGCCACGCGGGCTGCACCAAGGCCATCAGGCAAGGCCCCGTCACCCTGTACACGGGCTGCGCCTGCATACCGCGGACCATCACCCTGCCGCTGTACCACGACACGGAGAGCAACACGTCGGCCCCGTACGAGGCCATCAACACCGCGTGCGCGTCCGAGTGTCCCTACCTGTGGCTGTTCGTGTTCCTCTCCTTCTGCGTCATGTTGTTCACGTTCGTGGCCACGATGCCGGCACTCTCCGCCACGCTGAGGTGTGTCAGGGAGGAGCAGCGGTCGTTCGCGTTGGGCGTCCAGTGGATCCTGGTCCGTCTGCTAGGTACGATCCCCGCGCCGCTGCTGTTCGGGTTCCTCATCGACCTGGCCTGCCGCCTCTGGTCCGCGGGCGCCTGCCGCCTCTACGACAACCTCTACATGAGCAGATACATGTTGGCCCTAGCGTTGGTGGGTAAGCTTTGCTCGCTTCTGTTCTTCTTCTTCGCGTGGTGGTTCTATAGACCACCCGCGGCCAAAAACAACAACGCGGTGTCTCCGGCCGCGGACCTCGTGCTCCACGACAAGAGCAACGGAACCCTCAGCACCATCGCCAACGGGACCCTGGACAGGAACAACGGATATTGCAACGCAGCCCTGGAGTGTAGCGACCATATCTGA

Protein sequence:

>DPOGS202187-PA
MRDIFTWRNALVLFTEVAKGQCTMTVSDSGGCVNKAFDAVDDGFHTIDLRTRREDGPQADRADRESRCGWGALRPAWLQRFRTAKWALFWLCWAGAIQGMVVNGFVNVVITTIERRFGLRSMQTGVIAGGYDMASFLCLAPVTYLGGRTAASKPRWLGWGVLLMGVGSLLFAMPHFLVPQYKVAGEEEDDLCRVNRTLSSAPCSISGEGSWAVAVFVAAQLLHGAGATPLFTLGVTYIDENVSKKMSSVYLGVYYTMAVVGPALGYVVGGQMLQLYTDFLTVDSETLGITPVSSVWIGAWWVGFILSAVLCLVVAVPLLAFPYELPGADEIKASKVSEAHENATKSAAFTALRELPCAAVELFRNPTFMFLNLAGASEGMLISGFAAFLPKLIENQFGVSASQAALLLGVITVPAGGGGTFLGGWLVKRWRLACAGIIKLCVASTLLAAVFCFSFVLSCDDSPFSGVTVPYDSPSVPGGDGLLAQCNAACGCSDLAGVCGADGKAYGSPCHAGCTKAIRQGPVTLYTGCACIPRTITLPLYHDTESNTSAPYEAINTACASECPYLWLFVFLSFCVMLFTFVATMPALSATLRCVREEQRSFALGVQWILVRLLGTIPAPLLFGFLIDLACRLWSAGACRLYDNLYMSRYMLALALVGKLCSLLFFFFAWWFYRPPAAKNNNAVSPAADLVLHDKSNGTLSTIANGTLDRNNGYCNAALECSDHI-