Monarch geneset OGS2.0

DPOGS204917
TranscriptDPOGS204917-TA1245 bp
ProteinDPOGS204917-PA414 aa
Genomic positionDPSCF300340 + 41182-56521
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0108253e-7492.05% 
BombyxBGIBMGA001691-TA9e-6373.56% 
DrosophilaSln-PA3e-6148.65% 
EBI UniRef50UniRef50_B0WBF99e-6757.08%Monocarboxylate transporter n=4 Tax=Culicidae RepID=B0WBF9_CULQU
NCBI RefSeqXP_001846043.12e-6757.08%monocarboxylate transporter [Culex quinquefasciatus]
NCBI nr blastpgi|3227870966e-6850.00%hypothetical protein SINV_05876 [Solenopsis invicta]
NCBI nr blastxgi|3227870966e-6750.00%hypothetical protein SINV_05876 [Solenopsis invicta]
Group
Gene OntologyGO:00550855.7e-09transmembrane transport
GO:00160215.7e-09integral to membrane
KEGG pathway 
InterPro domain[1-214] IPR0161961.8e-19Major facilitator superfamily domain, general substrate transporter
[54-213] IPR0117015.7e-09Major facilitator superfamily
Orthology groupMCL15472 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204917-TA
ATGTCCCCGACTGATCAGGAAGAAGACTCCTTGTTGACGTTGTCGTCGGTGACAGCGCGCCCTCTGTTGGCAGCTTCGAGGAAGTTGAGACCACACTCAGATTCCGCTGATCTTTCCCCCCCGGACGGCGGGTACGGCTGGGTGGTGGTGTTCGGTGCCTTCATGGTTCAGTTCTGGGTGGCCGGCCTCTTCAAGTCATATGGAGTTCTGTACGTTGAAATTATGGAGACATTCCCGGACTCATCGGAGAGCGTCGCTTCATGGATACCAGCGGCACTGTCGACTTTGTGCTTGGCGCTTGCGCCGCTCTCATCAGCTCTGTGTGAGAAGTATTCGTGTCGTCTGGTGGTGTTCGCTGGAGGTCTCTGCTGCGGCGCGGGCCTGGCGCTCTCGTATTTCAGCAACGGTCTGCTGCATTTACTGCTCAGCTTCGGAGTGCTGACGGACGTCTCTATATATTATTATATATATATGTACCTATATTACAAAATTATTACTTACTATGTTAAGGAGTTCTTGGCGTTGGCTAACGGTATATGTGTGAGCGGGACGGCTGTAGGCAGCTTCGTTTTTCCAATGCTCATAGAGAAATTAGTAGGAACTTACGGATTCCACGGCACTGTACTGTTATTGGGTCTGCCCCGGGATGTTGTGAAGCTTCGTCTCAGACCCACTCGATCATCATCTATCTTACACTCTGTTGAAGACCTGTCCACGGACAGCACATCTCTTTACATTAAGCCACCTCTACCGAACAGAATAATACAAAGCAAGCTCTCCGAACCGGTGAGCATTAAGAAGGAAGAAGAAGAGCTAAAAGAAGAGACCGTAGAAGTTAAAGAAGAGAAGAGAGGATGCGTCGTGACGATATCTAGTCTTTCCCCCCCGGACGGCGGGTACGGCTGGGTGGTGGTGTTCGGTGCCTTCATGGTTCAGTTCTGGGTGGCCGGCCTCTTCAAGTCATATGGAGTTCTGTACGTTGAAATTATGGAGACATTCCCGGACTCATCGGAGAGCGTCGCTTCATGGATACCAGCGGCACTGTCGACTTTGTGCTTGGCGCTTGCGCCGCTCTCATCAGCTCTGTGTGAGAAGTATTCGTGTCGTCTGGTGGTGTTCGCTGGAGGTCTCTGCTGCGGCGCGGGCCTGGCGCTCTCGTATTTCAGCAACGGTCTGCTGCATTTACTGCTCAGCTTCGGAGTGCTGACGGGTGAGGATTCCGACGGATCTGGGATTATGGATTAA

Protein sequence:

>DPOGS204917-PA
MSPTDQEEDSLLTLSSVTARPLLAASRKLRPHSDSADLSPPDGGYGWVVVFGAFMVQFWVAGLFKSYGVLYVEIMETFPDSSESVASWIPAALSTLCLALAPLSSALCEKYSCRLVVFAGGLCCGAGLALSYFSNGLLHLLLSFGVLTDVSIYYYIYMYLYYKIITYYVKEFLALANGICVSGTAVGSFVFPMLIEKLVGTYGFHGTVLLLGLPRDVVKLRLRPTRSSSILHSVEDLSTDSTSLYIKPPLPNRIIQSKLSEPVSIKKEEEELKEETVEVKEEKRGCVVTISSLSPPDGGYGWVVVFGAFMVQFWVAGLFKSYGVLYVEIMETFPDSSESVASWIPAALSTLCLALAPLSSALCEKYSCRLVVFAGGLCCGAGLALSYFSNGLLHLLLSFGVLTGEDSDGSGIMD-