Monarch geneset OGS2.0

DPOGS202013
TranscriptDPOGS202013-TA1227 bp
ProteinDPOGS202013-PA408 aa
Genomic positionDPSCF300053 - 1022940-1024555
RNAseq coverage153x (Rank: top 53%)
Annotation
HeliconiusHMEL0040994e-16370.90% 
BombyxBGIBMGA002444-TA3e-14464.82% 
Drosophilartet-PA9e-0928.40% 
EBI UniRef50UniRef50_D6X4Q15e-6332.42%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X4Q1_TRICA
NCBI RefSeqXP_967538.14e-6131.48%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|2700012292e-6232.42%hypothetical protein TcasGA2_TC016221 [Tribolium castaneum]
NCBI nr blastxgi|2700012292e-6433.08%hypothetical protein TcasGA2_TC016221 [Tribolium castaneum]
Group
Gene OntologyGO:00550853.2e-20transmembrane transport
GO:00160213.2e-20integral to membrane
GO:00058862.7e-10plasma membrane
GO:00052152.7e-10transporter activity
KEGG pathway 
InterPro domain[1-408] IPR0161967.6e-49Major facilitator superfamily domain, general substrate transporter
[10-202] IPR0117013.2e-20Major facilitator superfamily
[14-30] IPR0019582.7e-10Tetracycline resistance protein, TetA/multidrug resistance protein MdtG
Orthology groupMCL19689 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202013-TA
ATGTCTTTCGAAATATCCCTTTTGCAGTTTGTTGCTTTTTTGGATTTACTTGCTGTTGGTCTGATAGTACCATTAATACCTAATCATGTTAAACAAATGGGTGGTAGCCAACTATACATTGGCCTTCTTGGCTCCATTTATGCTGGTTTCCAACTGGGAGCAGGCCCACTTATTGGAAGTTTAAGTGATATAAGAGGACGGAAACCAATACTACTCCTAACACTACTAGTATGTAGTGTTGCCTATATGACGTTGGGTTTAACGTCATCTATTGCTATTATATTTGTATTGCGGGCTGTTTTAGGTCTGTTTAAACAAACACAATTATTAACTAAAGCTTTGGTACCTGATTATGAAAAAGTGGAAAGCAAACAGTCCCAAATATATGGTAAAATGGCAGCAATATCAGGAGTTGGAATAACATTGGGACCAATTTTTGGTGGACACATTGTTGAAGACTACCCTCTTTATGGATTTACTATTATCGGTGCAATTGTTAGCACCTTCTTTTTAATAAATGCAGGATTGATATACATGCTGCCAAAAACTGATTGTATAATTAAAACTAAAGCCCAAACAAAAGATGTATCACAGAATTTCTTCCAATCGTTACTTTCTGGCATTAAACAGACTTTTGTTGAGTTATCCAAAGTGGAATGGCTAAAATATTGGAAAATTTTTATGTTTCAAGCCTTAAACAGTTTTGCTATGGGTGTTTATTACTCTAGTTATGCATTATATTTAAAAACACAGTTTGGTTTAACTCCAAAGAATATTGGATATGTGGTGTCCATACAAGGAGTTATTGGTGCAATATCTTCCTTTTTCATGGGTCGTATAAACAGTTTATATGTCCACGACAAAGACTATAGTATACGAAATAATCATGTATTTTTATTATTAAGTATGTCCTTATTAGGAATATGTTTGGCATTCAATATTTATATGTATTCAATTTTGTTAATTCCATTAGCGGTAGGTAATGCAGTTGGAAGACTGGTGACTCTTGAAATGGTGCTAAAAAGAAGTAAAGGGGATCATAGGGGAACTTTAATAGGAGCTTCCAATAGTGTAAGGTCATTAACTGGAGTCATAGCACCAATGGTGGCTGGTTTTATTGGGGAATTTTTTGGCATTTCATATGTTTTATATGCATCTTTCTTTTCAACATTTGTTGGGCTTATATTCAGTTATCAGTTTAGAAGTAAACAGAGTAAAGTGGATTAG

Protein sequence:

>DPOGS202013-PA
MSFEISLLQFVAFLDLLAVGLIVPLIPNHVKQMGGSQLYIGLLGSIYAGFQLGAGPLIGSLSDIRGRKPILLLTLLVCSVAYMTLGLTSSIAIIFVLRAVLGLFKQTQLLTKALVPDYEKVESKQSQIYGKMAAISGVGITLGPIFGGHIVEDYPLYGFTIIGAIVSTFFLINAGLIYMLPKTDCIIKTKAQTKDVSQNFFQSLLSGIKQTFVELSKVEWLKYWKIFMFQALNSFAMGVYYSSYALYLKTQFGLTPKNIGYVVSIQGVIGAISSFFMGRINSLYVHDKDYSIRNNHVFLLLSMSLLGICLAFNIYMYSILLIPLAVGNAVGRLVTLEMVLKRSKGDHRGTLIGASNSVRSLTGVIAPMVAGFIGEFFGISYVLYASFFSTFVGLIFSYQFRSKQSKVD-