Monarch geneset OGS2.0

DPOGS208898
TranscriptDPOGS208898-TA1797 bp
ProteinDPOGS208898-PA598 aa
Genomic positionDPSCF300009 - 744496-752300
RNAseq coverage340x (Rank: top 34%)
Annotation
HeliconiusHMEL0157720.087.64% 
BombyxBGIBMGA002471-TA0.076.33% 
DrosophilaCG4484-PA1e-14146.30% 
EBI UniRef50UniRef50_B4N9680.054.00%GK10931 n=3 Tax=Endopterygota RepID=B4N968_DROWI
NCBI RefSeqXP_623536.20.057.85%PREDICTED: similar to CG4484-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3504075360.058.11%PREDICTED: proton-associated sugar transporter A-like [Bombus impatiens]
NCBI nr blastxgi|3504075360.058.11%PREDICTED: proton-associated sugar transporter A-like [Bombus impatiens]
Group
Gene OntologyGO:00550852.7e-13transmembrane transport
GO:00160212.7e-13integral to membrane
KEGG pathway 
InterPro domain[68-596] IPR0161963.5e-31Major facilitator superfamily domain, general substrate transporter
[219-518] IPR0117012.7e-13Major facilitator superfamily
Orthology groupMCL11102 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208898-TA
ATGGTGGATAAATTGAACGAATATCAGGGCGTTACCGGTAGATTGCATTCTACTAGGGATAAATGTAAGGAGAAATGGTCTCAATGGAAGGAGGAACATCCTCAAGGTGTGCGAGGGACTGTCAGGGAGGCTCTGTTTGCTATACCTAATTCTGACGATGCTCCTGAAAGAGTTGTTCGGGAAGGTGAATACAGCGATCTGTTCAGGAGAAAAAATAGGCTCGAATTAACCCGAATTTCTGCAGCTGTGATGGGAATAGAATTTGCATATGCTGGAGAAACAGCATTCGTTTCTCCCACCCTTCTTCAAATTGGAGTGCCACATCAGCAGATGACTCTGGTATGGGCGCTTTCTCCATTAATAGGCTTCTTTATGACTCCACTTCTCGGTTCTCTTAGCGATAGATGCCAATCAAAATACGGAAGAAGGAGACCCTTCATTGTTTTGATGTCTATTGGAGTGTTTTTGGGTTTAATTCTTGTGCCAAATGGAGAAGATATTGGTTACGCTTTTGGAGACGAGGTCTTTGTTAACAAAACTGCTGTACCCTCGGTGTTGGGACCGCGAAGTTCAGTCCTTGAGGTGGAAGGGAATAATCATCATCCTTGGGGAGTGTTGTTCACAGTTTTAGGAACAGTGCTACTGGATTTCGATGCTGATGCATGCCAAAGCCCCGCACGAGCATATCTCCTCGATGTGACAGTTCCAGAGGATCACGCTAAAGGTCTGAGTACTTTTACCGTAATGGCCGGCCTTGGCGGTTTCATGGGATACGCCCTTGGTGGTATTAATTGGGACGAAACATCTCTTGGAGCGTTATTCGGTGGCCATGTCAGAGCTGTGTTCTTTCTAATCACTATAATTTTTATCGTATGCGTATCGGCTACTATAACCAGTTTTAAGGAAATACCGCTGTCAGAAATCAAAGAGACAGAAAACTATAATAAATTAAACGATAAAGACGAAGAAGAAAACCAGTTTGGAGAAGAACAAGATGGACTAAAGAAGGAAAATGCTTCGTATGGATCTTTAAATCAACCCGATCAACCAGCTGATGAAATTTCTCCTGATCCAAACCAACTGACTCTGACTATACCCGAAGGGCACGGCGAGCCTTTGTCTCTGAAGCACTACCTCAAATCCATTATACAGATGCCAAAATCTTTACGGGTCGTGTGTCTTACAAATCTCTTCTGCTGGATGGCTCACGTCTGCTACTCTTTGTATTTCACCGATTTCGTCGGAGAGTCTGTTTTCGGTGGAAATCCGGCTGCACCCGTGGGCAGTGAAAGTCGAACTAATTATGAAGCGGGTGTCCGATTTGGCTGCTGGGGGATGGCTATGTACTCTCTATCCTGTGCTTGTTATTCGACAATTATAGAAAAACTCATTAAAAAACTAGGGGCAAAAAAGGTGTACGTGGGGGGACTTTGCACGTACAGCTGTGGCATGTTTATGCTGTGTTTGCTGAGGGCGAGGGCGGCCGTCCTATTGTTCAGTTGGACAGCTGGCATCATGTACTCCACACTCTTCACCATGCCTTATCTACTGGTTGCACATTATCACGCCACTGGCATGTGGGACAGCGAAGGCGGAGGAAGTGGCCAGGAGCGTGGGATAGGCACAGATGTAGCGGTGGTCAGCAGCTGTGTGTTCGTGGCACAACTCCTCATCTCCGTCATCATGGGCTTCGCTGTCAAGGTCACAGGTTCTACAGCAGCCGTGGTCGCTGTGGCAGCCACGTTGGCGGCAACGGCCGCTTTTACTGCCACTAAGATTACCTATCTTGATCTGTAA

Protein sequence:

>DPOGS208898-PA
MVDKLNEYQGVTGRLHSTRDKCKEKWSQWKEEHPQGVRGTVREALFAIPNSDDAPERVVREGEYSDLFRRKNRLELTRISAAVMGIEFAYAGETAFVSPTLLQIGVPHQQMTLVWALSPLIGFFMTPLLGSLSDRCQSKYGRRRPFIVLMSIGVFLGLILVPNGEDIGYAFGDEVFVNKTAVPSVLGPRSSVLEVEGNNHHPWGVLFTVLGTVLLDFDADACQSPARAYLLDVTVPEDHAKGLSTFTVMAGLGGFMGYALGGINWDETSLGALFGGHVRAVFFLITIIFIVCVSATITSFKEIPLSEIKETENYNKLNDKDEEENQFGEEQDGLKKENASYGSLNQPDQPADEISPDPNQLTLTIPEGHGEPLSLKHYLKSIIQMPKSLRVVCLTNLFCWMAHVCYSLYFTDFVGESVFGGNPAAPVGSESRTNYEAGVRFGCWGMAMYSLSCACYSTIIEKLIKKLGAKKVYVGGLCTYSCGMFMLCLLRARAAVLLFSWTAGIMYSTLFTMPYLLVAHYHATGMWDSEGGGSGQERGIGTDVAVVSSCVFVAQLLISVIMGFAVKVTGSTAAVVAVAATLAATAAFTATKITYLDL-