Monarch geneset OGS2.0

DPOGS211259
TranscriptDPOGS211259-TA1734 bp
ProteinDPOGS211259-PA577 aa
Genomic positionDPSCF300425 + 37578-58276
RNAseq coverage978x (Rank: top 13%)
Annotation
HeliconiusHMEL0126953e-15972.53% 
BombyxBGIBMGA005437-TA0.066.13% 
Drosophilaspin-PD5e-11561.14% 
EBI UniRef50UniRef50_B3MF451e-16348.99%GF11295 n=8 Tax=Endopterygota RepID=B3MF45_DROAN
NCBI RefSeqXP_002050676.16e-17151.54%GJ22291 [Drosophila virilis]
NCBI nr blastpgi|1953839261e-16951.54%GJ22291 [Drosophila virilis]
NCBI nr blastxgi|1953839269e-16651.71%GJ22291 [Drosophila virilis]
Group
Gene OntologyGO:00550856e-39transmembrane transport
GO:00160216e-39integral to membrane
KEGG pathway 
InterPro domain[1-376] IPR0161961.1e-50Major facilitator superfamily domain, general substrate transporter
[55-377] IPR0117016e-39Major facilitator superfamily
Orthology groupMCL13085 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211259-TA
ATGGACCAAAACAGTATAACACCAAATACATCGAATCAGCAGTTAGTAATTAACGGAGATAATGAATCTGCAACTGCATTGTTGAACGAGAAGCAGGGTAGAAAGCGCGATAGTTTGAGAGATGTCACCTTCATCGAATTCATGACCGTCGGTATATTGTGTTATGTTAACCTTATCAACTACATGGACAGGTTCACCCTCGCCGGTGTATTAGGGGATGTCAAAGATGAATTTAACATTGGTGACGATTACGCCGGTTTCCTTCAGACCGTGTTTGTGATCGCTTACATGGTTTTCGCGCCAATATTCGGTTATCTCGGTGACAGATACTCCAGGCGGAGAATCATGGCATTTGGTGTGGCCCTGTGGAGTCTCACAACATTCGTGGGATCGTATATACCTGATTTCGCGTGGTTCGCGGTCTTCCGCGGCCTGGTTGGTATCGGTGAGGCGTCATACTCCACCATCGCACCGACCATTATCAGTGACCTGTTCGTCGGTAACGTCCGATCGAAAATGCTTGCGCTTTTCTATTTCGCGATACCAGTCGGGAGCGGTTTTGGCTATATAGTCGGTTCAGCGGCTGGTGCCGCTATGGGTAATTGGCGTTACGGGTTGCGTGTGACCCCGTTCCTCGGAGCCCTGGCCGTGGTGCTGATGCTGTGGGTCATGGAAAACCCTGAGCGTGGTCAGGCGGAGGAGAGCCGGATGAAACCGACTTCGTACCAGGAGGATCTCAAGTCGCTGATCAGGAATCCGTCATTCATGTTGTCGACTTTGGCTTTCACGTGCGTCGCGTTTGTGACGGGGGCGCTCGCCTGGTGGGGCCCGGACTTCATCAGGCTAGGGCTGACCCTGCAGACTGGACAGGAAGTCTCCATAGAAGGCGTATCATTCAAGTTCGGGCTGGTGGGTATGGCGGCGGGGGCGCTGGGCGTCCCTCTGGGGTCTCTGCTCGCCCAGCATATGCGGACCCGCACCCCCGCCGGCGACCCTCTGCTCTGCGGCTTCGCGCTGCTGGTCTCCTCGCCGCTGGTGTACCTCGCGCTGTTCTCCACCGCCCACTTGGCGGGCCTCAGTTACCTGCTCGTGTTCCTCGGCATGCTAACGCTCAACCTCACCTGGGTGTCGCTGGTGTTCGGAGCGCTGACCATGGCGTCGGGGCTGGTGGGTGTGCCGCTGGGTGCCTGGCTGGGCGCGGCGTTGATCGCTCGCTGGGGCCGCGCGCACGCCCTGCTGTGTGCCGCGGGGCTGCTGCTGTCCGCTCCCGCCATGACGCTCGCCATCTTCCTCACGGACAAGCACTACTACGCTCCGTTCGTGCTCATGTTCTTTGCCGAGCTCACGCTCAATCTCAACTGGGCTATCGTTGCTGACATGTCGCTGTACGTGGTGATACCACCAAGAAGATCAACGGCGGAGGCTTTCCAGATTTTGATCTCACATATGTTTGGGGATGCTGGCAGTCCCTATTTGGTTGGAGTTATATCCGAGAACTTGAAGAGATCGCTTTCACCTTTCGAAGAACCTAGCAACAGTGTCAAATTTCGATCGCTTCAGTACGCCTTGTTCATTACATGTTTCGTAGAGGTTATTGGAGGAATTTTCTTCCTACTGACGGCCATTTACATTGTGAGAGATAAACTTAGAGTTGAACGAGAAATCGCAGAAGCCGAGGCACAAAGTGCTGAACCATCTCACAGCAACGCACAAGAAAATCCCGGCGTTGAATAA

Protein sequence:

>DPOGS211259-PA
MDQNSITPNTSNQQLVINGDNESATALLNEKQGRKRDSLRDVTFIEFMTVGILCYVNLINYMDRFTLAGVLGDVKDEFNIGDDYAGFLQTVFVIAYMVFAPIFGYLGDRYSRRRIMAFGVALWSLTTFVGSYIPDFAWFAVFRGLVGIGEASYSTIAPTIISDLFVGNVRSKMLALFYFAIPVGSGFGYIVGSAAGAAMGNWRYGLRVTPFLGALAVVLMLWVMENPERGQAEESRMKPTSYQEDLKSLIRNPSFMLSTLAFTCVAFVTGALAWWGPDFIRLGLTLQTGQEVSIEGVSFKFGLVGMAAGALGVPLGSLLAQHMRTRTPAGDPLLCGFALLVSSPLVYLALFSTAHLAGLSYLLVFLGMLTLNLTWVSLVFGALTMASGLVGVPLGAWLGAALIARWGRAHALLCAAGLLLSAPAMTLAIFLTDKHYYAPFVLMFFAELTLNLNWAIVADMSLYVVIPPRRSTAEAFQILISHMFGDAGSPYLVGVISENLKRSLSPFEEPSNSVKFRSLQYALFITCFVEVIGGIFFLLTAIYIVRDKLRVEREIAEAEAQSAEPSHSNAQENPGVE-