Monarch geneset OGS2.0

DPOGS216026
TranscriptDPOGS216026-TA1623 bp
ProteinDPOGS216026-PA540 aa
Genomic positionDPSCF300067 - 800460-805472
RNAseq coverage32x (Rank: top 75%)
Annotation
HeliconiusHMEL0122290.078.09% 
BombyxBGIBMGA008876-TA2e-14580.12% 
DrosophilaCG10019-PB6e-10247.63% 
EBI UniRef50UniRef50_E0VX851e-15152.70%Monocarboxylate transporter, putative n=3 Tax=Pancrustacea RepID=E0VX85_PEDHC
NCBI RefSeqXP_975308.27e-15954.31%PREDICTED: similar to monocarboxylate transporter [Tribolium castaneum]
NCBI nr blastpgi|1892401951e-15754.31%PREDICTED: similar to monocarboxylate transporter [Tribolium castaneum]
NCBI nr blastxgi|2420205778e-15652.97%monocarboxylate transporter, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00550852.6e-11transmembrane transport
GO:00160212.6e-11integral to membrane
KEGG pathway 
InterPro domain[1-431] IPR0161964e-28Major facilitator superfamily domain, general substrate transporter
[24-347] IPR0117012.6e-11Major facilitator superfamily
Orthology groupMCL17322 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216026-TA
ATGAAACGACTTTTATCATGTACATATGAGGTCGTGATAGAAAGTAATTGGATCGTTTGCGGCGCCGCTTTCCTTGCTCACTTGCTATCAACCGGACTACAGCTTGCTTATGGCGCATTGCACGTGTATGCGATACGACACCTCGGGCCCGCCGCTGATCATGCCGTGTGGGCTGGCGCACTTTGTGTTGGAGTGTCACGAGCCGCTGGTGCCCTTGTAGCAGGGCGAAGGAAGTCTCCCCGCCTCGCAGCCCTAGTCGGTGGTTTGCTGTTACCCCTCGCCTGTCTCTTTACATCATTCGCCACTCAACTCCATCAAACTCTTTTAAGCTACGGCGTAGTCCTCGGTGTTGGCTGCGGGCTTGTGAGAGAAGCGGCAGGATTAGTTCTTGGAACTTATTTCCGACGGCGAAGACAGTTTGTTGAGCTTGTAGCACACGCCGGGGGTGGAGTCGGCATAGCGTTATTTAGCGTTGCCTATAAAGAAGCTGTTGGAAAGTTGGGATGGCGATTAGGACTTCAGGCAGTAACTGGTGTCCTTGTATTGGCATTTTTCCTAAGCACAGTTTACCGAAGTGCCTCCCTCTACCACCCACAACGACGTGCCATTTTACACCTCAAAAACCAACGTCGGAAGGTTAAGGAAAAGAAAACCCCTAAAATTTCGCCTTTTATCGATTTAAGCCCTCTTCACTCACGATCAGCAAAGGTATTGTTATTAGCTGCTGGGTTGGCAGCTTTTGGGCTTTATACCCCCGTATTTTTTCTTGCCTTGCAAGGATTTCAAGAGGGTTTGGAAGAGAGTGCTTTAGTATTGCTTCAGACTTTTTTGGGATTCGCAGCAGTGCTCGGTTGTGCTGGCTTTGGGCTCGTACTTGTCCGACCGTCAGCACAATGTCTCGTTTCTAAGCAATATTTGTGCCAAACTGCTATGTTAGGTATAGGTATTTCAATGCTCGCTCTGAGTAGTGTTGAAGGCTACCATGGCTATGTATTATTTACTTGGATGTATGGTCTGTGTCTCGGTGGATTTTTGTATTCAATAAAAATGCTTACAATGGAACGGGTTCGGGGACGACATTTTACAAAAGTTTGGAGTTACGTTCAAGGTGCGGAAGCGATACCAGTTATAGTTGGCGTGCCAATGACGGGTTATATTAATCAGCAAGCACCGAGAGCGGGATTTTACTTTTCGACAGCGGCAACACTTGCTGGAGCTCTGTTATTATTTTTTGTTGGTTTCTCAAAACGAGACCCCGAACCCGCACCTGCCCCTGCACCAGCTATTTCAGAGGCGTGTTTGTGTATTACACCTCCACCAAGATGTGAGCCTGCATGGTGTGCTTGTGGCGCTGGGGGTGCGACAGCTTACTGTGCATGGCCTGGGCCCTGTACAACGCGTTTACCCAAGTCTCTTTCATACGCCGCACCGTTGAATCGCAGTTGTTGTTCAATGGAGCGGTATCCTGAGTGTTGTCGTCGTGCTGCTTTGTTGCGTCCATCACGCAGTGTCCCAGAAGGTTTGGCGCGTCGTAGTAGCACAATCAGTAGATCCGCCTCTTGCAAGGCGCCTTGCCACCGCCGCGAACATCATCTCATCGAACAAATTACAACTTCTGTATAA

Protein sequence:

>DPOGS216026-PA
MKRLLSCTYEVVIESNWIVCGAAFLAHLLSTGLQLAYGALHVYAIRHLGPAADHAVWAGALCVGVSRAAGALVAGRRKSPRLAALVGGLLLPLACLFTSFATQLHQTLLSYGVVLGVGCGLVREAAGLVLGTYFRRRRQFVELVAHAGGGVGIALFSVAYKEAVGKLGWRLGLQAVTGVLVLAFFLSTVYRSASLYHPQRRAILHLKNQRRKVKEKKTPKISPFIDLSPLHSRSAKVLLLAAGLAAFGLYTPVFFLALQGFQEGLEESALVLLQTFLGFAAVLGCAGFGLVLVRPSAQCLVSKQYLCQTAMLGIGISMLALSSVEGYHGYVLFTWMYGLCLGGFLYSIKMLTMERVRGRHFTKVWSYVQGAEAIPVIVGVPMTGYINQQAPRAGFYFSTAATLAGALLLFFVGFSKRDPEPAPAPAPAISEACLCITPPPRCEPAWCACGAGGATAYCAWPGPCTTRLPKSLSYAAPLNRSCCSMERYPECCRRAALLRPSRSVPEGLARRSSTISRSASCKAPCHRREHHLIEQITTSV-