Monarch geneset OGS2.0

DPOGS201108
TranscriptDPOGS201108-TA1842 bp
ProteinDPOGS201108-PA613 aa
Genomic positionDPSCF300137 - 317566-391962
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0110070.078.29% 
BombyxBGIBMGA014449-TA0.066.96% 
DrosophilaCG31100-PA3e-1621.26% 
EBI UniRef50UniRef50_A7S0E67e-2122.75%Predicted protein n=1 Tax=Nematostella vectensis RepID=A7S0E6_NEMVE
NCBI RefSeqXP_001658148.16e-2124.63%sugar transporter [Aedes aegypti]
NCBI nr blastpgi|1571152161e-1924.63%sugar transporter [Aedes aegypti]
NCBI nr blastxgi|1565502771e-1828.77%PREDICTED: facilitated trehalose transporter Tret1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00550857.2e-17transmembrane transport
GO:00160217.2e-17integral to membrane
GO:00228577.2e-17transmembrane transporter activity
GO:00160203.6e-06membrane
GO:00228913.6e-06substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[13-596] IPR0161963.5e-33Major facilitator superfamily domain, general substrate transporter
[499-596] IPR0058287.2e-17General substrate transporter
[41-51] IPR0036633.6e-06Sugar/inositol transporter
Orthology groupMCL26198 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201108-TA
ATGATAGTGGGGGCTGTTCTTAACAAAAATCGCATTGAAAGGACCGCTTCTAAACGTTGGCGCGGCTCGATCCGTCGCCTGATAGCGGCCACTGTATACAATCTGTCCTGTTTCACACACGGCTGCAGTACTGGTTGGGTGTCAGGGGTTCTCGGCAACGAGGCTCTGACTGGCGGAGCCTGGCTGGCTGCCCTGCCATGCCTGGTAGCGCTGCCAGCCGCTCCGATGTTCGCTGTGCTGGCAGATAGCAAGGGACGGAAGGCCGGCGCTTTTGTTATCTGTATAAGTTTTATTATAAGCTGGTCGCTGGCCGCGTGGTTTGGGGGTCGCGGCGTGTGGGTGGCGCGAGTAGCAGCGGGTGCTGGAGGGGCCGGTGCCCTCGCCCTGGCCCCTCTGTACTGTGCGGAGATCGCACCAAGAACCAGAGGTTTGGCAGCCATGCCAGCACTCGCTTGCAGCTGTGGCATACTGTTCGCGTACGCAGCTGGCGGAGTGTTGTCAGCGCACGCCTTATCTCTATCGATGGCTGGTCCACCTGCTATACTACTGTTCTCACTCATCTGGCTGCCGGAGACACCTTCGTTTCTTATTAGTATTGGAAAAATTCAGGAAGCAGCAAAGATTATGTGCTGGTTTGACGGTTCAGACTTCAGAGAGGATCTCACTGACGTGATCGAGCAGCAAGAGGTCAAGATACGGATCTCAGAAGGTTACGGGAGGAAGGAGCTGCTGAGGAGACAGGACTCGGACACCTTCAGACCCATGCTGAAGAGAAGTAGCGGCCTCGAGTCTAACTCGGATAAAGAAAAGGACGACCAATCTGTGTGCAAAGAATTATGTGGGTTTGAGGAATATTTTGTATTTATAATTGGAATTGGATGGGTCCGTCGCTCATCATCACGCCGGGCGCTCCTCTCGTGCGTGGTTGTGGTGTCTGCTGCCGCGGGTTCCGGCGCCGTTGCTGTCAATAGCTTCGCAGCAGCGGTCGTCAGGCATTCCACGCACCGAGTACCCGTCCTTAACGACACGGCGTACAACTTCACTATATATAATGGGACAGTTCCAAGAGCTTTATTTGAGTCGTCTGAGGCTGGCTCTGTGCTGTGCGGAACAGCGCTCGTGCTCGGCGCAGCAGTGGCCACTGTTACCGTCGATAAAGTTGGCAGAAAGACGCTGCTTCTTCTGTCTTGTTCCGGAATCGCGTTCGGTTTGACAGTTCTTGGAATTTACTGTGATCCTCAATTACGGATGCACTCTCGCTATCTCCATAGAGTGTGGCCTTTGAGGAAGAGCACTTACAAGGAGAAGGTGATCCGAGACAAATCAGATATACCTTCGAATTACACGTTATACACCTCAACAGTTCCTCTGTTAAACGATAGCACAAAACCGTGGTATAGAATCAGTGCAGAAGATCACAATGTCACTGAATCTGATATGTCAGTGAAATACGAAGAAGCCGGTGACAAAGAAGAGGTCACGATATGGTTGCCAGTGGTTCTGCTGTCCATGGTCTTATTTCTGTACAACATCGGCCTGGGCTCTATACCTTATGTACTGATATCGGAGTTATTTTCCGTTCACGTCCGCAGTCTGGCTTCTAGCTTCCTGATCGCCTGGATGTGGATAAGTAACTTCCTGGTCCTTCGTTACTTTGGGACTATCGCCATCTCGCTCGGTCTGCACGCCACGTACTACATCTGCGCCTCCATCACACTCCTCGGCGCTGGATACATTTACTTAGTGATTCCCGAAACCAAAGGCAAGAGCCGGACCCAAATCACAGAAGCCTTGCAAGGACCCTGGCTCCTCTTTAAAAGGAAACGGAAGAACCAACGATGA

Protein sequence:

>DPOGS201108-PA
MIVGAVLNKNRIERTASKRWRGSIRRLIAATVYNLSCFTHGCSTGWVSGVLGNEALTGGAWLAALPCLVALPAAPMFAVLADSKGRKAGAFVICISFIISWSLAAWFGGRGVWVARVAAGAGGAGALALAPLYCAEIAPRTRGLAAMPALACSCGILFAYAAGGVLSAHALSLSMAGPPAILLFSLIWLPETPSFLISIGKIQEAAKIMCWFDGSDFREDLTDVIEQQEVKIRISEGYGRKELLRRQDSDTFRPMLKRSSGLESNSDKEKDDQSVCKELCGFEEYFVFIIGIGWVRRSSSRRALLSCVVVVSAAAGSGAVAVNSFAAAVVRHSTHRVPVLNDTAYNFTIYNGTVPRALFESSEAGSVLCGTALVLGAAVATVTVDKVGRKTLLLLSCSGIAFGLTVLGIYCDPQLRMHSRYLHRVWPLRKSTYKEKVIRDKSDIPSNYTLYTSTVPLLNDSTKPWYRISAEDHNVTESDMSVKYEEAGDKEEVTIWLPVVLLSMVLFLYNIGLGSIPYVLISELFSVHVRSLASSFLIAWMWISNFLVLRYFGTIAISLGLHATYYICASITLLGAGYIYLVIPETKGKSRTQITEALQGPWLLFKRKRKNQR-