Monarch geneset OGS2.0

DPOGS212675
TranscriptDPOGS212675-TA1395 bp
ProteinDPOGS212675-PA464 aa
Genomic positionDPSCF300198 + 199815-209020
RNAseq coverage17x (Rank: top 81%)
Annotation
HeliconiusHMEL0075102e-7935.38% 
BombyxBGIBMGA014055-TA1e-11545.68% 
DrosophilaCG10960-PB1e-4527.09% 
EBI UniRef50UniRef50_B4NLR65e-5029.01%GK18458 n=1 Tax=Drosophila willistoni RepID=B4NLR6_DROWI
NCBI RefSeqXP_002074319.18e-5129.01%GK18458 [Drosophila willistoni]
NCBI nr blastpgi|1954546052e-4929.01%GK18458 [Drosophila willistoni]
NCBI nr blastxgi|1947614523e-5229.52%GF14177 [Drosophila ananassae]
Group
Gene OntologyGO:00550851.9e-55transmembrane transport
GO:00160211.9e-55integral to membrane
GO:00228571.9e-55transmembrane transporter activity
KEGG pathway 
InterPro domain[18-457] IPR0058281.9e-55General substrate transporter
[1-456] IPR0161961.1e-51Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL31066 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212675-TA
CTAGGGAATATGCTCACTGGCATGTTGTACGTATGGCCTTCCTACACTGTCGTACATTTCACAAAAACTGATACAGAATATCTCGACGCTCCAATGACTGAAATTGAAAGTTCCCTAGTAGGGAGTCTGCCATCTTTGGGAGCCATGGCTGGCACCGTACTGGTTGGATGGTTGATGAGTTTATTTGGGAGACAGAGGACAGGATTGATCCTAGCTTTTCCAATGTTGGCTTCCTGGTTGATAATAGAACTAACGCGATCATCCATGGTGATTCTAATAGCGAGATTTCTTAGTGGTGTGTCTGGAGGTGGTTTTCTCGTCCATTCACCCATCTACATATCTGAGGTGGCGGAACCATCGATACGAGGAACATTGGCTTCTGCTCCAATGATATTTTACTGCATCGGCATCTTGGTTTCCTACTTAATGGGTTGGTTTCTAACCTACCGGTACGTCATATGGGGTAACTTGATCTGTTGTATTGTGTACGCCGCCCTTATGATGACGGTTACTGAAAGTCCAGTGTATTTGCTTCGGAAAAACAGAGAAGAGGAAGCTCGTTTGGCTGTCAGTCATTACTTAGGTGGCTCTGTGAATTCTAAGGCCGTACTTGAGGAATTTTCTAGACTTAAGGAACAAATAGCGCCTTCGGTAGAACTAGTAGCAAGAGACCCCGATCAAAGTGAAAAACAAAAACTTAACGCTGAAATCAATGATGTCGAGCAACAAACGAAGGATGAAATGTCACCATTGAAAATGTTATTTGTTTCGCCAGCATCTCGACGCGCGTTCACAACTGTTTTCCTCATACTATCTCTACAAGTTATGATGGGTATGATAGCTGTGCAAGTGTACGCTAAGGATATCTTCACTCACGCGGCGCCTAATCTTTCTTCACATTTTTGTTCCGTTATGTTCGCGTCCACTCTCCTGCTTGGCAGCCTTATTAATGGATTACTGTTGGACAGATTCGGCAGAAAGATTCCATTGATCTCATCGTCTATTGGAACAGGTGTATGCCTTGTAGCTTTGGGTTTCTTTATTCAGACCAGTGTTGCTCCAGCGTGGGTGACAGCTTTAGTGATATTATTCTTTTGTTTTTTCTTCATGAGCGGTGCCGGCTCCGTGCCCTATGTTATGTTGGCTGAGATGTTTATATCAGAGGTTCAAAGTGTCGCTTCTATGATTCTCATGGAATGGGTGTGGCTTTTAAACTTCCTGCTGGTTGGTGTTTTTCCATTCATGATTAAGTTCCTCGGGGTCCACGGCGTCTTCTACAGCTTTGCAATGTTCGCATTGCTTGATTGCCTCGTTGCCATCTTCATAGTACCAGAGACCAAAGGACTCACCATCGATCAGATTCAAAAACTATTGCTTTACCGGCCATTGAAATGA

Protein sequence:

>DPOGS212675-PA
LGNMLTGMLYVWPSYTVVHFTKTDTEYLDAPMTEIESSLVGSLPSLGAMAGTVLVGWLMSLFGRQRTGLILAFPMLASWLIIELTRSSMVILIARFLSGVSGGGFLVHSPIYISEVAEPSIRGTLASAPMIFYCIGILVSYLMGWFLTYRYVIWGNLICCIVYAALMMTVTESPVYLLRKNREEEARLAVSHYLGGSVNSKAVLEEFSRLKEQIAPSVELVARDPDQSEKQKLNAEINDVEQQTKDEMSPLKMLFVSPASRRAFTTVFLILSLQVMMGMIAVQVYAKDIFTHAAPNLSSHFCSVMFASTLLLGSLINGLLLDRFGRKIPLISSSIGTGVCLVALGFFIQTSVAPAWVTALVILFFCFFFMSGAGSVPYVMLAEMFISEVQSVASMILMEWVWLLNFLLVGVFPFMIKFLGVHGVFYSFAMFALLDCLVAIFIVPETKGLTIDQIQKLLLYRPLK-