Monarch geneset OGS2.0

DPOGS214017
TranscriptDPOGS214017-TA1494 bp
ProteinDPOGS214017-PA453 aa
Genomic positionDPSCF300540 - 6565-11828
RNAseq coverage117x (Rank: top 58%)
Annotation
HeliconiusHMEL0061736e-16169.87% 
BombyxBGIBMGA010699-TA5e-10072.24% 
DrosophilaCG11655-PA3e-8746.32% 
EBI UniRef50UniRef50_B0W6D59e-11049.40%Sodium-bile acid cotransporter n=7 Tax=Pancrustacea RepID=B0W6D5_CULQU
NCBI RefSeqXP_001662576.19e-11350.12%sodium-bile acid cotransporter [Aedes aegypti]
NCBI nr blastpgi|1571324662e-11150.12%sodium-bile acid cotransporter [Aedes aegypti]
NCBI nr blastxgi|910908941e-11349.77%PREDICTED: similar to sodium-bile acid cotransporter [Tribolium castaneum]
Group
Gene OntologyGO:00160203.4e-117membrane
GO:00085083.4e-117bile acid:sodium symporter activity
GO:00068143.4e-117sodium ion transport
KEGG pathway 
InterPro domain[119-440] IPR0026573.4e-117Bile acid:sodium symporter
Orthology groupMCL16545 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214017-TA
ATGTGTCCATTATGGCCGCTGCACCTGATAGTGTTGTATCTCCTGGTGCTAGGCCCTATGTGGGTTCTATGTCAGGCGGCACCGAATCTCATGGCCACGTACCTGCCAACCGAAGTCGAGGAAGTGCACATGGGGGACACGTATTACGTCGATGTAAACGTTACAGGTGTAGGTCTTCGTCCAGGTGCTCGTCTCCAGGTGAACGTTAGAGACGAACACGTGGCGGACACTAAATGGAATTCCTCATATCAGGTCACAGAAAATGACGTCAGTGAGGGGAAGTTTAAAGGGAGGTTGAGGATTATAGGGAATTTTCTTGGAAGGACGATATTATCATTGGAGTCCCATGGCGTTGGGGACACCATAGAACCCGTGAATGGTACGCTAGCAGTCACCGTTACCAGACCCCAGAGAGTCATAGACACTATATTCACTACTAGTATAGCGATATTCATATCAATTGTGTTCATAAACTTTGGTTGTGCGATGCACTGGGATGAAGTTAAAGGGGTTGTGAGAAGACCTGTCGGACCTATCATAGGTCTCTGTGGACAGTTCGTGTTTATGCCATTGATATCCTTCGGTCTTGGTTACCTGATCTTCCCCTCATCTCCATCTCTCCACCTGGGTATGTTCTGCACGGGTGTAGCGCCGGGTGGCGGTGCCTCTAACATATGGACCTTCATATTGGGAGGGAATCTGGATTTGAGCCTCACAATGACATCCATATCAACCTTGGCTGCGTTCGGTTTCATGCCGCTGTGGCTGTTCACGCTCGGTCAAGTGGTGTTCGCTAACGCCAGTATAGTGGTTCCGTACAGTCGGATAGCTATGTTCGTGGTGGGTCTGATAGTCCCCCTCATCATCGGCCTGGCTATGCAGAAATTCACCCCTCGACTATCAGCCTTCATGGTCCGGATATTGAAGCCTTTCTCGTCTTGTATATTGATTTTCATTATAGTGTTCGCGATTGTCACCAATTTATACATATTCGAACTGTTCTCGTGGCAGATACTACTAGCTGGTATGGGTATCCCGTGGCTGGGATACATATCGGGATACCTGGTAGCCTGGCTATTCCGTCAACCTCATCCGGATGCACTGGCTATATCGATAGAAACGGGCATACAAAACACTGGCATCGCTATATTCCTACTGAGATACGCTCTGCCACAACCGGAAGCCGATATAACAACCGTGGTACCCGTTGCCTGTGCCATAATGACACCAATCCCGATGACAGCAATATTCATATATCAAAAATTAAGTTCATGCATCAAAAACAGAACACAACAGAAGAAAGATGTCGACCGCCCTGAGACCGTCGAGTCCGGCATCGAACCTGCCATTAATGGAAAATAAGGTGAACACACACAAGCACGCACACACACGCTCACACACACACGACATATATATAATCTTTTTTTGTCTATCTGTTTTACTGCGACTTTGGGATTTAAAAAAAATGCTAAATATATTACTTTTATGCAATAA

Protein sequence:

>DPOGS214017-PA
MCPLWPLHLIVLYLLVLGPMWVLCQAAPNLMATYLPTEVEEVHMGDTYYVDVNVTGVGLRPGARLQVNVRDEHVADTKWNSSYQVTENDVSEGKFKGRLRIIGNFLGRTILSLESHGVGDTIEPVNGTLAVTVTRPQRVIDTIFTTSIAIFISIVFINFGCAMHWDEVKGVVRRPVGPIIGLCGQFVFMPLISFGLGYLIFPSSPSLHLGMFCTGVAPGGGASNIWTFILGGNLDLSLTMTSISTLAAFGFMPLWLFTLGQVVFANASIVVPYSRIAMFVVGLIVPLIIGLAMQKFTPRLSAFMVRILKPFSSCILIFIIVFAIVTNLYIFELFSWQILLAGMGIPWLGYISGYLVAWLFRQPHPDALAISIETGIQNTGIAIFLLRYALPQPEADITTVVPVACAIMTPIPMTAIFIYQKLSSCIKNRTQQKKDVDRPETVESGIEPAINGK-