Monarch geneset OGS2.0

DPOGS201544
TranscriptDPOGS201544-TA1971 bp
ProteinDPOGS201544-PA656 aa
Genomic positionDPSCF300006 + 1724711-1739241
RNAseq coverage902x (Rank: top 14%)
Annotation
HeliconiusHMEL0090563e-17679.80% 
BombyxBGIBMGA002723-TA3e-16474.74% 
DrosophilaOatp74D-PC1e-17447.85% 
EBI UniRef50UniRef50_Q9VVH92e-17247.85%GH24467p n=19 Tax=Endopterygota RepID=Q9VVH9_DROME
NCBI RefSeqXP_001353787.23e-17446.79%GA20447 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984658475e-17346.79%GA20447 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1984658478e-17147.23%GA20447 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00160209e-193membrane
GO:00068109e-193transport
GO:00052159e-193transporter activity
KEGG pathway 
InterPro domain[38-639] IPR0041569e-193Organic anion transporter polypeptide OATP
[1-649] IPR0161966.6e-29Major facilitator superfamily domain, general substrate transporter
[444-481] IPR0114972.6e-06Protease inhibitor, Kazal-type
Orthology groupMCL16859 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201544-TA
ATGAAAGGCGTCGAGGCCACACCAGAGGAACAGGAACGGTTAGCCACCGGAAACAACAATGGGTCCCTCGATTGTAAATTACCCCCGCAGCAGGCTAGGCCGCGAACAGCTTTTCATTCGACTAGAATGTTCATGCTGGTGTTCCTATCGGGATGGGTGTTGCAAGGTATGTTCCTTACATACTTCGTGAGTGTCACCACGACGGTGGAGAAGTTGTTTAAAGTCGAATCGAAGACGACCGGAACGCTGTTGGCTGCTACGGAGATCGGTCAGATATGTACCGCGCTGTTCCTGACATACCTCGCCGGTCGCGGCCACAGACCGAGATGGATCGCTTGCATGATGCTAGTGCTGGCTGTGGGTGTGATCGGCTGTGTGGTCCCTCACATTATGTACGGAACCCGACTGATGGATGTACATTTAGATTCTCATAGAGGTGGTTCAGCGCCGGTTTGCTCCGCCGACGGTAATTCTTCGATGGTGACATGTGACGACGCTCACGCCAGGAGTACCGCCGCCCGGTCCTATATAACTTCGGTGGTGATACCTTGGCTGTTCGTCTGCTTGCTGGTGGTCGGCGTTGGACAGACCGGCATCGCTACACTCGGTATACCATACGTAGACGACAACGTCGGCAGTAAGCAATCACCACTGTACATGGGTATTACGATCGGTATAAGAGTGATCGGGCCGGCGCTAGGGTTCCTGCTGGGGGCTCTGTGTACCCGTGTGTACGTGGATCCCCTCAAAGACCCCGGGTATGAATACAGCGACCCCAGATGGGTCGGCGCCTGGTGGCTGGGTATGGTGTTCATAGCCGGTTTCATAGTTATACTTTCGACACTAATGTTCTTCTTCCCGCGTCAAATAAAACAAGGGCCCATGCAGTTGACGCTGAAAAAGAGCAAGGAAAAAAACGAACCATTCTTCAAAGATTTCTTCGTGACGATAAAACGCCAACTGACAAACGACATCCTCATGTGGCGCACGGCCTCGTCGGTGCTCCACCTGATGCCCATCAGCGGCATATACTCCTTCCTGCCCAAATACCTGGAGAAGCAGTTCCGTCTGCCAACGCACGACGCCAATCTTGTCTCGGGTCTTGGTGGTATCTTAGTGATGGGTTTGGGCACCATCACGTCGGGAGTGATCATCCTTAAACTGGTGCCCACAGCAAGACAGGTGGCCGCCTGGACAGCTGCTACAGCTGCTATATATAGTGCTGGTATGGTCGTCCTAATGTTCGTTTCCTGTCCCGAGGAGACGTTCCGTTCTTTGGACGCTACCAACTTGTCTTGCAGCACGGACTGTCACTGTTACGGAGCCCCCTACTCGCCCGTATGTGGACAGGACTCGTTTACTTATCAGACGCCATGTCAGGCCGGATGTGCCAACAGCCAACAACTAGACAATAGCACCTGGCTATACTACAACTGTTCGTGCATCAACGGGACCCTGCTCGGCGAGAAGGCCGTTCAGACCCTGGAGCGCTACTCCGTGCCCGACGGCTTCTCGTCCAGCTCGTTCGCGGTGTCCGGCGAGTGCGGCGGCGCGTGTAACCAGATCTACGTGTTCATAGCGATCTTCGCCGCTATAATGTTCGTCCACGCCACGGGCGAGGTCGGCGCTGTGCTCATCATCATACGCTGCACCGACAAACAGGACAAGGCGATGGCGATGGGTGTGATACAGTTCGCGATCGGCGTGTTCGGTAACGTGCCCTGTCCGATCATATTCGGGGCGGCCATAGACGCCGCCTGCCGCCTCCGAGACGCCGCGTGCGGACTCATAGGCGCCTGCGCCAGCTACGACAGTGACAGATTCAGACATTTCTTCCTAGGTTTGTCCGCCGGTTTGATGTTCCTTGCCTTCATAATGGATATGTTGGTGTGGCTCCGCGCTAGCCGTATCGACATGAACCCCGCCGACCGCAGCTCAGACCGCTCGCCGGCCGGGGACACGTCGCTCTGA

Protein sequence:

>DPOGS201544-PA
MKGVEATPEEQERLATGNNNGSLDCKLPPQQARPRTAFHSTRMFMLVFLSGWVLQGMFLTYFVSVTTTVEKLFKVESKTTGTLLAATEIGQICTALFLTYLAGRGHRPRWIACMMLVLAVGVIGCVVPHIMYGTRLMDVHLDSHRGGSAPVCSADGNSSMVTCDDAHARSTAARSYITSVVIPWLFVCLLVVGVGQTGIATLGIPYVDDNVGSKQSPLYMGITIGIRVIGPALGFLLGALCTRVYVDPLKDPGYEYSDPRWVGAWWLGMVFIAGFIVILSTLMFFFPRQIKQGPMQLTLKKSKEKNEPFFKDFFVTIKRQLTNDILMWRTASSVLHLMPISGIYSFLPKYLEKQFRLPTHDANLVSGLGGILVMGLGTITSGVIILKLVPTARQVAAWTAATAAIYSAGMVVLMFVSCPEETFRSLDATNLSCSTDCHCYGAPYSPVCGQDSFTYQTPCQAGCANSQQLDNSTWLYYNCSCINGTLLGEKAVQTLERYSVPDGFSSSSFAVSGECGGACNQIYVFIAIFAAIMFVHATGEVGAVLIIIRCTDKQDKAMAMGVIQFAIGVFGNVPCPIIFGAAIDAACRLRDAACGLIGACASYDSDRFRHFFLGLSAGLMFLAFIMDMLVWLRASRIDMNPADRSSDRSPAGDTSL-