Monarch geneset OGS2.0

DPOGS201431
TranscriptDPOGS201431-TA1779 bp
ProteinDPOGS201431-PA592 aa
Genomic positionDPSCF300006 - 1283165-1287628
RNAseq coverage272x (Rank: top 40%)
Annotation
HeliconiusHMEL0090910.093.20% 
BombyxBGIBMGA002586-TA0.089.66% 
DrosophilaCG31663-PA1e-15869.01% 
EBI UniRef50UniRef50_Q17A700.068.88%Putative uncharacterized protein n=6 Tax=Culicidae RepID=Q17A70_AEDAE
NCBI RefSeqXP_319201.40.065.26%AGAP010048-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582990820.065.26%AGAP010048-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582990820.065.70%AGAP010048-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[19-592] IPR0161962.7e-38Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL17737 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201431-TA
ATGGCTAAATCATTAGACGACATTTCAAAAAAATATAACGAAGACAACACATCCATGGAGGAAGACACTGAGGACTCTCCAGGATGTTGGCAGCGGATGATCGTTTTCTTTCGGATCAATCCAAATCTGATTATGTTAAAAGTGACTTTATTTGTGATGTATGGAGCCACAGCATCTCTTTTGCCGTATTTGACGATTCATATGCAAAGTATAGGGCTCACTGTACCAGAAATTTCTTTTGTGTATTTAGCATTGCCATTCACAACATTTTTAAGTCCTCCTATTACAGGATTTCTTGTGGATCGTTTTGGAGAGTACAAACCGGTGGTAATCACAGCTCTGATCCTAAACGCAGCATTCCATCATTCACTCCTTCTCATCCCTCATCAGGAAACGCCAGGAGTAATGCCATCTGCTTACGTCATGCGACATCCGATAAAGCGGAGCGTTGAGATATGGTGGAGCCCTTGTCCCAGTAGAGAGTGTCCGGATGAGGACGAACAAGTGGACTTTGTCCTTGACCGCTGTGACGATCATTGTATGCTGTCCGGACCCACTAAGAGTCCCACCAACGACCCCAGCGATGACCAGGACGATATTGACTTTCATATATCACACAAAATTAATCCACCAAATCTCTCCAACAGCAGTTTTTTAAATATAACTGATGACGAGGAGGTCTATAATGACGCCGCATTTTTCTTGATAGAAGTACACGATGATTTGGGTGAACCTAAAGAACAACTTGGCATTGAAATGGAAAGGGACGACAATGATACAGTTACAGATTTCAGAACAAGGTTCGGTGAGAAACTTTTGATTGCCCACGGTGTAAATATTACAGCGCTAGATAAGGAGGATTTAAGATGCGGTGGTCTAGTTATGTTGCACAATATGACAAAACTGTCACAGCAGCGACTGGAGACTTTGTCGGCAGACTGCATGGTTCAGAAATGTGATTTTAGAAAAGGAGGTCCAGAAATATGTCCCCCCGATTACAAGGAAAGCGATGATAAAACATTTTACATTTATTTCTTTTTACGCTTCATGGGCACTATAATGTTATCCGCAGGCGTAACTATTATGGATCCCATAGCTCTTACCATGATACAAAAATACGGTGGTGACTTCGGAAGAGAAAGGCTCTTCTCTAGTATCGGAATGGCAATATTTTCACCTATCACAGGCATGCTCATAGATATGAGGAGCAAACAAGTCGGATACACAGATTACTCGGCCGCTTTCTATACATACGATGTCTTGCTCCTGATATCGGCCATAACCGTGGCCGTGATGCCTCTAGGAGCTAAACTTCCAGCGGACAATCTCTTACACGACCTCGTGAACATTATAAAGATGCCACATATTATAATATTTATATTCTTCTTATTCTTACTAGGAAATTTTTGGGGTTTCATCGAATCATATTTGTTCTTGTATTTGAAGGAACTAGGAGCTCCGAACTTTTTGCTCGGTATTACAGTAACAGTGGGAACCCTCAGCAGCATTCCGTTCCTGTACGGTGCTGAGGCGATCACATCTCGTATAGGACACGTCAACGTCATCATCATCGCATTCTTCTCGCACGCCGCTCGCCTCGTCGGATATTCATTCATCGAGGATTCCTGGTGGTGTTTCCCCTTCGAGGCCATGGAATCACTGTCCGTACATCTAATGTGGGTGGCCGCGGCGACATACTGTGCAATGTTGGCACCCAAGAGTTTACTGGCTACTCTCATCGGAGTATTGGGAATGGCCCATTTCAGTCTTGGTCAGTAA

Protein sequence:

>DPOGS201431-PA
MAKSLDDISKKYNEDNTSMEEDTEDSPGCWQRMIVFFRINPNLIMLKVTLFVMYGATASLLPYLTIHMQSIGLTVPEISFVYLALPFTTFLSPPITGFLVDRFGEYKPVVITALILNAAFHHSLLLIPHQETPGVMPSAYVMRHPIKRSVEIWWSPCPSRECPDEDEQVDFVLDRCDDHCMLSGPTKSPTNDPSDDQDDIDFHISHKINPPNLSNSSFLNITDDEEVYNDAAFFLIEVHDDLGEPKEQLGIEMERDDNDTVTDFRTRFGEKLLIAHGVNITALDKEDLRCGGLVMLHNMTKLSQQRLETLSADCMVQKCDFRKGGPEICPPDYKESDDKTFYIYFFLRFMGTIMLSAGVTIMDPIALTMIQKYGGDFGRERLFSSIGMAIFSPITGMLIDMRSKQVGYTDYSAAFYTYDVLLLISAITVAVMPLGAKLPADNLLHDLVNIIKMPHIIIFIFFLFLLGNFWGFIESYLFLYLKELGAPNFLLGITVTVGTLSSIPFLYGAEAITSRIGHVNVIIIAFFSHAARLVGYSFIEDSWWCFPFEAMESLSVHLMWVAAATYCAMLAPKSLLATLIGVLGMAHFSLGQ-