Monarch geneset OGS2.0

DPOGS201534
TranscriptDPOGS201534-TA1461 bp
ProteinDPOGS201534-PA486 aa
Genomic positionDPSCF300006 + 1425098-1430578
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0090821e-11968.66% 
BombyxBGIBMGA002712-TA2e-5350.95% 
Drosophilaw-PA4e-4928.86% 
EBI UniRef50UniRef50_E2A9Q84e-6529.85%Protein scarlet n=7 Tax=Formicidae RepID=E2A9Q8_CAMFO
NCBI RefSeqXP_395665.34e-5730.29%PREDICTED: similar to scarlet CG4314-PA [Apis mellifera]
NCBI nr blastpgi|3838663404e-6731.01%PREDICTED: protein brown-like [Megachile rotundata]
NCBI nr blastxgi|3071827363e-6830.02%Protein scarlet [Camponotus floridanus]
Group
Gene OntologyGO:00160202e-22membrane
KEGG pathway 
InterPro domain[210-419] IPR0135252e-22ABC-2 type transporter
Orthology groupMCL26501 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201534-TA
ATGTATTTTAGTGGTGCGGGTAAAACTACATTCTTGGTATCATTGGCTGGTAAATGTAACCTGCCCAACACTGGTACCGTCAACGTATGCGGCAGCAATGTCAAAGATTTGTCACAAGGAGTGGTGGAAGTACTACCACAGTTTGATGTCTTCATGGATAGTCTGAGTGTTAAGTACCATACAAGTGTATCGTTTCAGTTAATCTCCAGCCCGCAAATTCTCATCTGTGATGAACCAACGACGGGTCTCGACAGTTACAACGCCTCCCTTGTTATAGGAGTTCTTAAGAAGTTATCGCTTTATGGCAAAACTGTCATATGCTCCGTACACCAACCGTCCTCAGATTTGTTCAGAGAATTCAACTCTGTTGCTTTAATGTCTGATGGAAAAATGCTGTTCCATGGGACAAGGAATGAAATCAAATCTTTATTTGAAAGACTAAATCTTAAATGTCCTGTCAATTATAATCCGTCAGAATTTTACATAAAAGTTGTCTGTACGGACTCGTTCAAAAATATCTCAGAACTAATGTTGGATAGGACCGAGAATGATGCATACTATGAAGGCAGCGGTACTACACCTATGATATCAGCACATGTCGAAATATGTCAAAGGAATTGGTTCGTCCAAGTCTACCTGTTGTTATGGAGGTCATCACTGACTTTGAAGAAAGGTATCAAAGAATACATCGTCCAAATCCTAATAACTACGGTAATATCTTCGATGATATTGGGGACGTGTTACATTGGCATATCTGGGACGACTCAGCAAGGAGTCCAGGACCTTAGAGGTTTCCTATGGCTGGTTTGTTCGGAAGTATCCTTTAGTGTTTCATACTGTGCTCTGTACGCCTTCCAAAATGATCTCACTCTGATCAAGAGAGAGGCCGGGATTTATAAGGTTTCAGCTTTCTACGTCAGCAGATTACTGAGTACGTTGCCTCGATGCATCATCTGGCCAGTCCTGTATGTGGTGCTGGCGACGCTGGCAGTGGAACTGCCAAATCATTTTCTGACAGCCACGAAATTTGTACTTGTACTTATTGTGACTGCAGTTGCTTCGTCAGCTTATGGTCGAAAAATAGGCGCCCTTTTTACGTCCTCGGGTATGATGGCGGACGTGATGCCATGTGTAGATCTGCCGTTATTTCTCATGTCTGGCGCTTTCCTTAGAATATCTTCACTACCTCATTGGCTGTATCCACTTAAATATATCTCCCATTTCTACTACGGTATGGACGCCCTAAGCAACATTTATTGGAGGCAGATTGATACGATAGAATGTCCTCTGAATTCGACAACAACTTGTTTAAGAAGTGGTGCTGCTGTTTTAATGGAAAATGGATATTCAATTGATTTCGTCTTGCACGACACTTTGGGTATTGTTTTTATAACTCTATTGTGGAGTCTGTTGGGGCTGTTTGGTTTGAAACGAGAAGAAAATAAAGGTTATGCTTATTAA

Protein sequence:

>DPOGS201534-PA
MYFSGAGKTTFLVSLAGKCNLPNTGTVNVCGSNVKDLSQGVVEVLPQFDVFMDSLSVKYHTSVSFQLISSPQILICDEPTTGLDSYNASLVIGVLKKLSLYGKTVICSVHQPSSDLFREFNSVALMSDGKMLFHGTRNEIKSLFERLNLKCPVNYNPSEFYIKVVCTDSFKNISELMLDRTENDAYYEGSGTTPMISAHVEICQRNWFVQVYLLLWRSSLTLKKGIKEYIVQILITTVISSMILGTCYIGISGTTQQGVQDLRGFLWLVCSEVSFSVSYCALYAFQNDLTLIKREAGIYKVSAFYVSRLLSTLPRCIIWPVLYVVLATLAVELPNHFLTATKFVLVLIVTAVASSAYGRKIGALFTSSGMMADVMPCVDLPLFLMSGAFLRISSLPHWLYPLKYISHFYYGMDALSNIYWRQIDTIECPLNSTTTCLRSGAAVLMENGYSIDFVLHDTLGIVFITLLWSLLGLFGLKREENKGYAY-