Monarch geneset OGS2.0

DPOGS204749
TranscriptDPOGS204749-TA1578 bp
ProteinDPOGS204749-PA525 aa
Genomic positionDPSCF300231 - 390061-392995
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0104571e-12547.85% 
BombyxBGIBMGA013708-TA9e-10839.81% 
DrosophilaCG7458-PA2e-7630.06% 
EBI UniRef50UniRef50_D2CG574e-8434.64%Putative uncharacterized protein GLEAN_10997 n=1 Tax=Tribolium castaneum RepID=D2CG57_TRICA
NCBI RefSeqXP_970562.18e-8534.64%PREDICTED: similar to AGAP012383-PA [Tribolium castaneum]
NCBI nr blastpgi|910947212e-8334.64%PREDICTED: similar to AGAP012383-PA [Tribolium castaneum]
NCBI nr blastxgi|3323751101e-8336.19%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00550851.6e-14transmembrane transport
GO:00160211.6e-14integral to membrane
GO:00228571.6e-14transmembrane transporter activity
KEGG pathway 
InterPro domain[133-494] IPR0161962.9e-30Major facilitator superfamily domain, general substrate transporter
[128-487] IPR0058281.6e-14General substrate transporter
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204749-TA
ATGGTAGCACTCGATGAGAAAAGTGTGCATTTAGATGAGGTGTTGAAAAAATTTGGTGCTATTAATCGCTATCACGTTCAAGCGATTATTCTGATCAGTTTTGCTTTTTTCAACAATGGCATGCATTGTATTAATTACGTTATAGTTGCAGAGGAAACACCATACAGATGCAAAATAGAACCGTGTGAAATGCAAAGCAATGTTTTCGAGTCTTCGTGGTACAATTTCAGTGTTCCGCCAGACTCACGAGGATGTAAGAGATACCGATTCGATCTCAAAGAATGTGATCCATTCAGCTTCAACACATCAATTGTAGAAACTTGTGATGAATGGGTTTACAAGAAAAACGATAGTTTTGTCGCCGAGTTTCAGTTAGCGTGTCAGAATTGGAAGAGAACTTTTGTTGGAACCGTTCATAGTATTGGACTCATGTGTGGATTATTTTTTCAAGGACAGCTATCCGATAAGATTGGACGGAAAGCCGCAACAGTAATATCAGGTTTAGCTGCCGGAGTGCTAGGCATTGCAAAGAGTTTTTCAAATACCTATACATTATACGTAATTTTAGAGTGGCTCGAAGCCACTATAGGAGATAACTGTTCACCTGCTTTTATTTTAGCCGTTGAACTGGTTCACAGCGATTACAGATTGCACCAACAAATATTTTTGTGCGTCATTGCTGCCATGGGTGGAGTAACATTTGCCATTGCAGCATATCTAGTGCCATACTGGAGAGACTTTGTTCGAGTTATTTACGCTCCAGCTTTCCTGTACATCATAATATATTTCATCATGGATGAGAGCGTGCGATGGTTATTAAGTAAAGGAAAGAGGAAACAAGCAACAAAGTTATTATTAAAAATGGCGAAGATAAATAAAATCTCTCTGGATAGAAAAATGCTTCTTAATATACAGTGTGAAGAGGGGACAAAGAATTCTGCCTTGAGGGAGACATTCCGATCAAGGATTGTTGCGAAACGATTTTTAATATGTTTATTATGGTGGACTTCTTGTACGTTCATCTGTTTTGGCTTAATAGTGAATGTGGGGTCGCTCGCAGGGAACAAATATTTTAATTTTGCCATCATGTCCTTATCTGATATCCCAGCCAGTATTGCGATGTTTTATGTACTAAAACGATTTAGAAGGAAGAAGCCACTTCTTGTTTCATTTATAACAGCAGGACTCTTATGTTTAACTCAACCCTTCGTGCCGAAAAAATATGTATGGATGTCAACGGGCATGTACTTTCTGGGAAGATTTGTATCAACATTCACTTTCGGTACCGTATATATTTACACTTCGGAATTGTTTCCAACTTATTCGAGAAATTCTATGCACGCTTTGTGTTCAGCTATCGGCCGTATCGGATCCATTTTGGCGCCAATGACACCACTACTCACTCAGTATATGGAAAGTCTTCCGACTTTGTTATTTGGGAGTATATCAATCGCTGCCGGTCTCACAACTTTACTGGTACCTGATTTGGATAATGAACCTCTACCTGATAATGTGCACCAAGCTGAAGCTATAGGTGTTACTCGTCTTAATGTTGAGAAGGTGAAAGAAGACAGTTGA

Protein sequence:

>DPOGS204749-PA
MVALDEKSVHLDEVLKKFGAINRYHVQAIILISFAFFNNGMHCINYVIVAEETPYRCKIEPCEMQSNVFESSWYNFSVPPDSRGCKRYRFDLKECDPFSFNTSIVETCDEWVYKKNDSFVAEFQLACQNWKRTFVGTVHSIGLMCGLFFQGQLSDKIGRKAATVISGLAAGVLGIAKSFSNTYTLYVILEWLEATIGDNCSPAFILAVELVHSDYRLHQQIFLCVIAAMGGVTFAIAAYLVPYWRDFVRVIYAPAFLYIIIYFIMDESVRWLLSKGKRKQATKLLLKMAKINKISLDRKMLLNIQCEEGTKNSALRETFRSRIVAKRFLICLLWWTSCTFICFGLIVNVGSLAGNKYFNFAIMSLSDIPASIAMFYVLKRFRRKKPLLVSFITAGLLCLTQPFVPKKYVWMSTGMYFLGRFVSTFTFGTVYIYTSELFPTYSRNSMHALCSAIGRIGSILAPMTPLLTQYMESLPTLLFGSISIAAGLTTLLVPDLDNEPLPDNVHQAEAIGVTRLNVEKVKEDS-