Monarch geneset OGS2.0

DPOGS207941
TranscriptDPOGS207941-TA1395 bp
ProteinDPOGS207941-PA464 aa
Genomic positionDPSCF300090 - 305495-308914
RNAseq coverage381x (Rank: top 31%)
Annotation
HeliconiusHMEL0051860.085.32% 
BombyxBGIBMGA000383-TA1e-14863.14% 
DrosophilaCG10616-PA1e-3725.21% 
EBI UniRef50UniRef50_D6WJD03e-7937.00%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJD0_TRICA
NCBI RefSeqXP_968062.15e-8037.00%PREDICTED: similar to odorant response protein ODR-4 [Tribolium castaneum]
NCBI nr blastpgi|910840771e-7837.00%PREDICTED: similar to odorant response protein ODR-4 [Tribolium castaneum]
NCBI nr blastxgi|3227945491e-8138.92%hypothetical protein SINV_00989 [Solenopsis invicta]
Group
KEGG pathway 
Orthology groupMCL12121 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207941-TA
ATGGTTAGAACGGTTTATTCGCCAGAATATTGTCTTCAATATTTAGAACAATATGCTTCGCAGGAAACATTTGTGTTAGGGTTAATTTTAGGCCAGATATCAAATGCACGTGAAAATGTTATCCATCTGGCTAGAACTCCTGAAGAAAAACCAGCAGAGTCATCATCAGAAGCTAGCTTCACATCGGAAAAAACAGAGTTAGAGCATAATTTATTAGGAGTTTCGGAAGCGTGGGTAGCGGATCACGCCAGACATGTTACTCGAATGCTTCCTGGAGGCATTTTTGTACAAGGAATTTTTATAACAAGCGATGAGGACATCTTTGAAGACCCAAACTATTTTAGCAAAGTTAGAGCTGTGCTTAATTATGTTTACAAAGCATTAGGCATGAATCAGTTTATGTATGGAAATTATACAAATTTAAATGAAAAGCTCATACTGCACATGTCCACAAGCACAAAGGTCCTTACATGTAAGAGTATTGAAGTAGGCATTGGTAAGAATTCAGCAATAAAACCGGTGGATTGGAAGTTTTTACCAAAACATCAGCAATGGCAGCGTTTAGATTGTTATTATGAATTTGATGAAGTTTATCCTGTGATAGTAAAGAAAAGTGGAATATCTGTGAAGCAACAATTTCAGCAAATTTTAGAGTCTGCTCACAAAACTATTTCATCCAGTGTAATGTTTATTGATGGTGAACTGAAAGACGGCTCTGAGGCACTGGAGCAACTCGTTAAAAAGAAAAGACCCAAAAGTAGTGCAAAATGTGCCCAAGATGCCCCTAAATCAATGCATGTGTCTTTGTTTGTCCCATTTGAAAATAGTTTGCCAGAAACCGTGGAGTATTTGGAATGTGATGGCAGCATTCATTTCAGTGGTGTTGTGTCATCTAGTGTGTTCATGTACCCTAAGGCCACAGTTAGCGAAGCTATTTTAGCTGTAAAGCAGGACATAATAAGATCATTAGCTTCCCGATTCACTATGCACTGTGATGCTTTGATTGATGATAATTTATTACCCGAGGAAAAAGTATGTTTCAATGAGCCGCCAAGACGAGTGTTAGTTCCGGTTGGGTCTCTTTATTTTTGTGACTACCTCTTCCCCGGGGAGGCTCCTGCTGAGGCTCTGTTGTCTGTCAAAGAGCTATTAGACTTACAAATAACAGAGAGTGAGGTTATTTGTGATGTGGAAACTCCCGCAGATACGTCTGAATTTGATGCCCTTGATAGGGATACAAGCAGCGAAGAACTCTTAGCCACTCCTCAAGAAGCGAGCCAGTTCATGTATATAACCGGCATATGTTTTGCCATGCTTGTTCTGTTTGTGTCAATTATAATACATTATTATGATGCCATTGTTCACTTTATTAGTAACTTAGTGTCTAGATCTTAG

Protein sequence:

>DPOGS207941-PA
MVRTVYSPEYCLQYLEQYASQETFVLGLILGQISNARENVIHLARTPEEKPAESSSEASFTSEKTELEHNLLGVSEAWVADHARHVTRMLPGGIFVQGIFITSDEDIFEDPNYFSKVRAVLNYVYKALGMNQFMYGNYTNLNEKLILHMSTSTKVLTCKSIEVGIGKNSAIKPVDWKFLPKHQQWQRLDCYYEFDEVYPVIVKKSGISVKQQFQQILESAHKTISSSVMFIDGELKDGSEALEQLVKKKRPKSSAKCAQDAPKSMHVSLFVPFENSLPETVEYLECDGSIHFSGVVSSSVFMYPKATVSEAILAVKQDIIRSLASRFTMHCDALIDDNLLPEEKVCFNEPPRRVLVPVGSLYFCDYLFPGEAPAEALLSVKELLDLQITESEVICDVETPADTSEFDALDRDTSSEELLATPQEASQFMYITGICFAMLVLFVSIIIHYYDAIVHFISNLVSRS-