Monarch geneset OGS2.0

DPOGS208580
TranscriptDPOGS208580-TA1572 bp
ProteinDPOGS208580-PA523 aa
Genomic positionDPSCF300064 + 1793120-1801849
RNAseq coverage159x (Rank: top 52%)
Annotation
HeliconiusHMEL0049342e-15187.75% 
BombyxBGIBMGA010592-TA0.071.93% 
DrosophilaCG4324-PA6e-12445.89% 
EBI UniRef50UniRef50_Q8SYM59e-12245.89%CG4324 n=16 Tax=Endopterygota RepID=Q8SYM5_DROME
NCBI RefSeqXP_001811945.10.063.34%PREDICTED: similar to SVOP protein [Tribolium castaneum]
NCBI nr blastpgi|1892376490.063.34%PREDICTED: similar to SVOP protein [Tribolium castaneum]
NCBI nr blastxgi|1892376490.062.98%PREDICTED: similar to SVOP protein [Tribolium castaneum]
Group
Gene OntologyGO:00550851.7e-11transmembrane transport
GO:00160211.7e-11integral to membrane
KEGG pathway 
InterPro domain[24-517] IPR0161965.3e-41Major facilitator superfamily domain, general substrate transporter
[70-171] IPR0117011.7e-11Major facilitator superfamily
Orthology groupMCL14928 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208580-TA
ATGCGTAGATCTAAAGGATATCAAGATCTTGATGAAAATGTTGGTGATGGACGTCAACCGGCGCCTCCGATGCCGCCTCCTCAGGAAATCGAAATGGCATCGGTTTCCGTCGTTCCCGACGATACATTCACAGTCACTCAGGCGGTGAATGCCCTAGGGTTTGGTTGGTTTCAAGTGAAGCTGTCCTTGTATACGGGCCTCTGTTGGATGGCAGATTCTATGGAAATGACTATATTGAGTATTTTATCACCCGCCCTGCATTGCGAATGGAATATTAGCAAGTACCAACAAGCCCTTACAACAACAGTTGTTTTCATGGGAATGATGCTGAGTTCAACATTCTGGGGAAATATAAGTGATCGTTATGGCAGAAAAACTGCGCTAAGTATGTGTGGCGTTCTGTTGTTTTATTATGGACTCTTAAGTGCAATAGCTCCAAATTTCCTATGGTTGTTATTTTTGAGGGGTCTAGTCGGTTTTGCTATAGGATGTTGTTTCTGGGCCCTCGGAGCTTGTTTGGAAGTAGCTTTGGCACTGGTTGTGATGCCAACATTAGGTGTGCATTGGCTTCTGGCATTATCTACTGTACCGTTGCTTATATTTGCACTTATATGTCCGTGGCTTCCTGAATCTGCCAGATTCCATGTAGCCAGTGGTCAACCTGACAAAGCTTTAGCAACACTCGATCAGATAGCCCGGGATAACGGCCGTCCTATGTTGCTGGGTAGACTTGTCTGCGATGACTCCGTGGGTATCGGACAAAGAGGACGTCTTAAGCATCTTTTAATACCACAGCTTAGAAACACCAGTCTGCTACTTTGGGTTATTTGGATGTCCTGTGCGTTCTGCTACTACGGTCTAGTGCTGATGACAACGGAACTGTTCGAGACTGATGCTGGCGAGGAGCCCTGTGCCGCTGACTGCCGTCCGTTACAGACCACAGACTATATGGACCTGTTGTGGACCACACTAGCAGAATTCCCCGGAATATTTGCAACAATATTTATAATCGAGAAGTTTGGACGCAAGAAGACGATGGCATCTCAATTCGTTATATTCGCCATGTGCGTCTGCGTATTAACATACAACGCGAACCGTACCTTCCTCACTTGCACATTGTTCCTGGCTCGCGGAATAATCGCTGGTTTGTTCCAAGCTGCCTACGTTTACACGCCGGAGACATACCTCCGGATGGAATTGCACTCCATACAACCTCATCTGCTCATTGTAAATGGCCGCTATGAGATGGTTCGACGACTGCGCCCCCACCCGGAACCGTTCGCCGACCTTCTGGACGCTATCCCCGTGGGTCAGCAGTACTGGCTGCAGCTTTTCCAGGCGGCTGGATATAACATATCGTTGAGGTCAACGGCCGTAGGTGCTTGTAGTGGTGTTGCCAGACTAGGGGCCATGGTTACACCATATGTAGCACAGGTACTGTTAAGGAACTCAGTGTTAATAGCCACCGCTGTATACTCTGTGGCTGCATTGTTAGCAGCTGCTGCCTGCCTCGCTCTGCCGATAGAGACTAAAGGCAGAGAAATGAAGGATACTGTCACTGTCACCGGCTGA

Protein sequence:

>DPOGS208580-PA
MRRSKGYQDLDENVGDGRQPAPPMPPPQEIEMASVSVVPDDTFTVTQAVNALGFGWFQVKLSLYTGLCWMADSMEMTILSILSPALHCEWNISKYQQALTTTVVFMGMMLSSTFWGNISDRYGRKTALSMCGVLLFYYGLLSAIAPNFLWLLFLRGLVGFAIGCCFWALGACLEVALALVVMPTLGVHWLLALSTVPLLIFALICPWLPESARFHVASGQPDKALATLDQIARDNGRPMLLGRLVCDDSVGIGQRGRLKHLLIPQLRNTSLLLWVIWMSCAFCYYGLVLMTTELFETDAGEEPCAADCRPLQTTDYMDLLWTTLAEFPGIFATIFIIEKFGRKKTMASQFVIFAMCVCVLTYNANRTFLTCTLFLARGIIAGLFQAAYVYTPETYLRMELHSIQPHLLIVNGRYEMVRRLRPHPEPFADLLDAIPVGQQYWLQLFQAAGYNISLRSTAVGACSGVARLGAMVTPYVAQVLLRNSVLIATAVYSVAALLAAAACLALPIETKGREMKDTVTVTG-