Monarch geneset OGS2.0

DPOGS215607
TranscriptDPOGS215607-TA2826 bp
ProteinDPOGS215607-PA941 aa
Genomic positionDPSCF300041 - 2288236-2298128
RNAseq coverage324x (Rank: top 35%)
Annotation
HeliconiusHMEL0058150.075.48% 
BombyxBGIBMGA003667-TA0.075.44% 
DrosophilaOatp30B-PB1e-15958.23% 
EBI UniRef50UniRef50_B4JPT70.049.38%GH13584 n=4 Tax=Coelomata RepID=B4JPT7_DROGR
NCBI RefSeqXP_002078778.10.054.53%GD23610 [Drosophila simulans]
NCBI nr blastpgi|1955778430.054.53%GD23610 [Drosophila simulans]
NCBI nr blastxgi|3071994140.057.63%Solute carrier organic anion transporter family member 5A1 [Harpegnathos saltator]
Group
Gene OntologyGO:00160207.7e-134membrane
GO:00068107.7e-134transport
GO:00052157.7e-134transporter activity
KEGG pathway 
InterPro domain[23-700] IPR0041560Organic anion transporter polypeptide OATP
[79-468] IPR0161966.9e-30Major facilitator superfamily domain, general substrate transporter
[488-518] IPR0114974.6e-08Protease inhibitor, Kazal-type
Orthology groupMCL13427 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215607-TA
ATGACAGAGGGAGAGAAACAGCAGAGCGCTGGCGGCGCCAATGCCGCACCACAACACAGGAAGGGGCACAGGAGACAAGAGTCCATGTACGCTATGACGGGTCTGTATGCGGAGTCTGGCTGCGCGGAAGGCGACGCCGGGCCGAGCGTACCCCCGCCAGCACACCCTCGAGATACCACCAAATGTCACAGCAGAAACCCCTCTGCTGGCATATGCGACAGAGACCGCGAGCGGGAAAAAGAGAAAGAGAAACCCCACCAAATGTTCCCAGAAATTTTAGACATTCCTCATGACTCAAGAGACTGCGGTATTCTATCGTGGAGACCGCTATTTATTCAGAGATTTTCTAGTATTAAAGTAGTAACGCTGCAGCAAGCGTTAAGCTCGGGCTACATAAACTCTGTGATTACAACCATCGAGAAACGTTTCGAGATACCCTCCAGCCTCTCGGGGCTCATAGCGAGCAGTTACGAGATTGGCAACGTCATCACCGTCATATTCGTTTCATATCTCGGCAGTAGACGACACATTCCAGTATGGATAGCAGTCGGTGCCGTCATTATGGGCATCGGCTCGTTGGTTTTCGTCGTGCCGCATTTCATAGCGGAAGCTAACAGCGAAATGATGATGAATAATAAGTCAGATGAGAACATTTGTCGACTGCCGCGAGCCTTGGAACAGGACATGAGCGGATTGGGGAGATTGTCCCCTGGTTTGCCGCCAAGCAATCTAAGGCCAGAAAACTGTATCAAGAGTTCGCCCAGCACGTTTGTGCCAGTGATGGTATTCGTGGTAGCTCAACTGCTGCTGGGTTGCGGCGGCTCCCCACTACTGACCCTCGGTACAACGTACGTGGATGATCACGTGCGACCAGAATCATCCAGCATGTACATCGGATGTATGTATAGTATGGCTGCTTTCGGTCCCGTGCTGGGATTTCTTCTTGGCGCCTACCTCCTTCAATTCCACATGGACTCGTTTTCCGGCACTATCATCCCATTAGATCCCGGCGACCATAGATGGGTGGGTATGTGGTGGGGAGGCTTCCTACTCTGTGGTCTTCTTCTCATCCTCGTGGCGATTCCTTTCTTCTCGTTCCCGAAAGTTTTGGTTCGCGAGAAAGAGAAGATTCGTCTCGTCGAGAAAGCAGCCGCTGCGAGCGGTTCCTCGACATCCAAACCGCCGCCGAAACCACAGTCAAATATCAAAGACACTGGATATGGTAAAGATATCAAAGACATTCCCGTATCCATGTGGCGGCTCCTTAAGAACCCCGTGTACGTGGTGACGTGTCTCGGAGCTTGTATGGAACTCATGATAGTGTCCGGCTTCGTGGTGTTCCTACCTAAATACCTAGAAACACAGTTCAGTCTCGGCAAGAGCCAGGCGAGCGTTTTTACCGGTGGCAACGTAGAGCCGTTCAAGGTCAACCTGACGGCCGCCTGTAACTTCAACTGTCTGTGTACGGAGACCGACATGGAACCTGTCTGCGGTAACAATGGCCTCACGTACTTCTCACCGTGTCATGCTGGGTGTGCCGCCTTTTCTTCCAGATCCAACTTCACTAACTGTGCCTGTGTCCACGAGAACAGTCGCGACATGCTAGGTGTGGGAGTGGGACTCGTGAGTGGTGCGTCCGCGTCTGCTCTGACCGCGGGGGCGGCCCGTGAGTACAGCGAGGTGACGGTGGTGCCAGTGGCCACGGCAGGTGCCTGTAACCCACCCTGTACCACCATCTTCCCGTTCCTCGTGCTGCTGTTCTTCATGACTTTCGTGGTAGCGGTCACACAGATGCCGCTACTCATGATAGTTTTGAGGTCGGTGAGTGAGGAGGAGCGTTCTTTTGCTCTGGGCATGCAGTTTGTAATATTCCGTCTATTCGGCTACATCCCAGCACCCATCGTCTTCGGCAACCTCATCGACTCGACCTGCATATTGTGGAAACAATCATGTAACGGTGAACAAGGAGGAAGGTGCCTGTTGTATGACATAGAGCAATTTCGATACAGATACGTCGGCCTCTGCGGTGGCATAAAAATAGTCGCATTAGGTATATTTCTAGCGGACTGGTGGCTGGTGAGAAGACGGAGACATCTAGAAACATCAGCTCCATTAGATCCTCAAAAGGACATTGCCGGTTCTATCATTAGTCTAGACAAATTGTTCGAGGAGTTACCGTCTGCGGAGAACGCGAGCGGCTTCAGGTCGGGAATAACTTCCGGTCTCAGTTCCGCCATCAGCACACCTAACGATCCCCTCGACCCGCACGAAGCTCAGGACCAGCGTCGTCTGCAGCGCACCGACTCTCAGTACTCGCAGGAGTCCCAGAGTCGCGCCAATTCTCGGGTGCTGGTAGCGTCGCGTCACCTCCGCAACGACTCCAAGACCATTCAACTGGAACCGCGAGCTCGACATCACATCGAGGAGCGACCTCGGGACTTTCCTCGCTCCACCTCGCGGGACTTTTCAGCTCACTCTCATTCAAGATCCACTTCGCGCGACTTCAAACCGCACTCGCGCTCAGATTCCCGAGATCTCGGACTCGAACAGCTGAAACAGCTCGCCCTCAAAAGCATGGACAGTCTCGACTTAACAGTGCTACCGCTCGCCAAATGTGCGGACGAGGAGAGCAAACGGCTCATAGAGGGCGCCGGTGTGCTGCGGCATAGACGAACCGGTTCCAGAGACCTCAAACCTAGCGAAAGTAAACACAAGCGAACCTCATCACACCACATCACTATGGAACCCGGTGATCTCAGTCTGCAGATACAGAAAGGGCGAAGCGTCGATCAGCTGGCGACCACCCAGATAGACCCGCGGGTGTAA

Protein sequence:

>DPOGS215607-PA
MTEGEKQQSAGGANAAPQHRKGHRRQESMYAMTGLYAESGCAEGDAGPSVPPPAHPRDTTKCHSRNPSAGICDRDREREKEKEKPHQMFPEILDIPHDSRDCGILSWRPLFIQRFSSIKVVTLQQALSSGYINSVITTIEKRFEIPSSLSGLIASSYEIGNVITVIFVSYLGSRRHIPVWIAVGAVIMGIGSLVFVVPHFIAEANSEMMMNNKSDENICRLPRALEQDMSGLGRLSPGLPPSNLRPENCIKSSPSTFVPVMVFVVAQLLLGCGGSPLLTLGTTYVDDHVRPESSSMYIGCMYSMAAFGPVLGFLLGAYLLQFHMDSFSGTIIPLDPGDHRWVGMWWGGFLLCGLLLILVAIPFFSFPKVLVREKEKIRLVEKAAAASGSSTSKPPPKPQSNIKDTGYGKDIKDIPVSMWRLLKNPVYVVTCLGACMELMIVSGFVVFLPKYLETQFSLGKSQASVFTGGNVEPFKVNLTAACNFNCLCTETDMEPVCGNNGLTYFSPCHAGCAAFSSRSNFTNCACVHENSRDMLGVGVGLVSGASASALTAGAAREYSEVTVVPVATAGACNPPCTTIFPFLVLLFFMTFVVAVTQMPLLMIVLRSVSEEERSFALGMQFVIFRLFGYIPAPIVFGNLIDSTCILWKQSCNGEQGGRCLLYDIEQFRYRYVGLCGGIKIVALGIFLADWWLVRRRRHLETSAPLDPQKDIAGSIISLDKLFEELPSAENASGFRSGITSGLSSAISTPNDPLDPHEAQDQRRLQRTDSQYSQESQSRANSRVLVASRHLRNDSKTIQLEPRARHHIEERPRDFPRSTSRDFSAHSHSRSTSRDFKPHSRSDSRDLGLEQLKQLALKSMDSLDLTVLPLAKCADEESKRLIEGAGVLRHRRTGSRDLKPSESKHKRTSSHHITMEPGDLSLQIQKGRSVDQLATTQIDPRV-