Monarch geneset OGS2.0

DPOGS203650
TranscriptDPOGS203650-TA1539 bp
ProteinDPOGS203650-PA512 aa
Genomic positionDPSCF300010 - 2970782-2973655
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0030370.071.98% 
BombyxBGIBMGA001008-TA2e-5325.33% 
DrosophilaCG8389-PB7e-8034.22% 
EBI UniRef50UniRef50_Q7QJP11e-8635.21%AGAP007601-PA n=4 Tax=Culicidae RepID=Q7QJP1_ANOGA
NCBI RefSeqXP_308271.42e-8735.21%AGAP007601-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582853665e-8635.21%AGAP007601-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582853662e-8534.85%AGAP007601-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550851.4e-22transmembrane transport
GO:00160211.4e-22integral to membrane
KEGG pathway 
InterPro domain[1-505] IPR0161967.7e-50Major facilitator superfamily domain, general substrate transporter
[37-450] IPR0117011.4e-22Major facilitator superfamily
Orthology groupMCL18412 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203650-TA
ATGTCAAAAAGGAATTCTGATGTACAACCAAGGACGGAAAAGATTATCAAGCTCTTACCACCGGATGGAGGGTGGGGTTGGATGGTGCTATTAGGCACTGGACTCTCTAATATTTTTAACCAGTCCATGTTGTCTCTATTCAGCTTGTTATATGGAGATGCTCTAGAAGCTATGGGACATGAGACAAAAGGAGCAGCCATTGTTTTAAGCACTATGTTATTTGTCACAAATTTTGGAGGTCCTATAGCGGGAGCACTTATAAAAATAACGTCACCTAGGTTTGTGGCAGTCCTCGGAGCCTTTTTTTGTACAACCGGTATATTACTTAGTGGATTTTCAACCAACATTTGGCATTTAATTCTCTCTTATGGAGTTTCACTTGGTTTGGGTCTAGGCTTTATTCAAAATGCTTCATTTGTGGCTATTAATGGATACTTTAAGTTAAGGAAAAATTTAGCCGTTGGCCTAGCAATGGTGGGAACTGGAATTGGCCAGACGCTTATGCCGCATGTAGTGCGATACTTTTTAGCCAACTATGGCTTTAAAGATGCTTGTATATTACTGGCATCATTAAGTTTACACGGGATTTGTGGGACGATGCTTATTCAACCCGTTGAGTGGCATATGAAAAAGATTGAGGAAGAGATAGTTATTGATGAGAAGATGTACTTGTTAGACGAAAAAACGGATGCGATAAGCAAAAAAGATTATAGCAGCAATGAAACTTCAAATAAAAATGGTAACCAAACTACTTTGCCTGCGGCAACAGACTTCAATGCGAAAGCTGATTCTAAATCCGTTAGCAGTAAAGAAAAAAAAATGAAACAAAGTCCTGAACGGGCTAAAAATAAAGATGGCAGTATGGAAGGACCGACACAAGAAAAAACACTATTAAGAAAGATTTACGATTTGTTTGATATGTCACTATTATCGAATCCACGTTTTATTAATATTATCATTGGAACTGCACTTACTGTGGTTTCTATACAAAACTTCAGCATGATATTTCCATTCTTTCTGCAGAAAGCTGCGTCAATGAATAAACAACAAACCGCTACTTGTATGTCAGCTGTGGCTTTGGCTGACATCATGGGAAGAATTGTTTTACCAATATTGCAAGATAGATTCCAAATCAAAGCCAGGATGATGCTTATTATGACTAGCATTTGGTTGATAATTGTAAGACAAATTTTGGCTTATCAAAGAGATCTATATTTGTTACTCATCCTCTCCGCCTTGTACGGATTTGGAAGAAGCATGGTAATTGTAGCTAGGAATATAGCTATTTCAGAACCCTGTAGGACGGAACAAGTGCCTTCTGCTGTTGGTTTAGGCATGTTAACAATGGGTATTATTGTACCCCCCGTCGGATATTTCTTAGGCTGGATCAGAGATTATACAGACAGTTTTTTAATATGTATCACAGCCCAAAACATGCTACTTTTATTGTTCCTGGCTATGTGGATTCCTGATATGTTATATTTCTATTTACAAGAAAAGAAGGAGCGAAATAAAAATGTTAGAAAATCAATAACTTAG

Protein sequence:

>DPOGS203650-PA
MSKRNSDVQPRTEKIIKLLPPDGGWGWMVLLGTGLSNIFNQSMLSLFSLLYGDALEAMGHETKGAAIVLSTMLFVTNFGGPIAGALIKITSPRFVAVLGAFFCTTGILLSGFSTNIWHLILSYGVSLGLGLGFIQNASFVAINGYFKLRKNLAVGLAMVGTGIGQTLMPHVVRYFLANYGFKDACILLASLSLHGICGTMLIQPVEWHMKKIEEEIVIDEKMYLLDEKTDAISKKDYSSNETSNKNGNQTTLPAATDFNAKADSKSVSSKEKKMKQSPERAKNKDGSMEGPTQEKTLLRKIYDLFDMSLLSNPRFINIIIGTALTVVSIQNFSMIFPFFLQKAASMNKQQTATCMSAVALADIMGRIVLPILQDRFQIKARMMLIMTSIWLIIVRQILAYQRDLYLLLILSALYGFGRSMVIVARNIAISEPCRTEQVPSAVGLGMLTMGIIVPPVGYFLGWIRDYTDSFLICITAQNMLLLLFLAMWIPDMLYFYLQEKKERNKNVRKSIT-