Monarch geneset OGS2.0

DPOGS206658
TranscriptDPOGS206658-TA1239 bp
ProteinDPOGS206658-PA412 aa
Genomic positionDPSCF300048 + 107642-109842
RNAseq coverage327x (Rank: top 35%)
Annotation
HeliconiusHMEL0064090.090.36% 
BombyxBGIBMGA008491-TA4e-17885.79% 
Drosophilafend-PB2e-1532.42% 
EBI UniRef50UniRef50_D2A2X57e-10958.20%Putative uncharacterized protein GLEAN_07896 n=2 Tax=Tribolium castaneum RepID=D2A2X5_TRICA
NCBI RefSeqXP_001812942.19e-11958.85%PREDICTED: similar to ld14 CG12664-PB [Tribolium castaneum]
NCBI nr blastpgi|1892362292e-11758.85%PREDICTED: similar to ld14 CG12664-PB [Tribolium castaneum]
NCBI nr blastxgi|1892362291e-11558.49%PREDICTED: similar to ld14 CG12664-PB [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL20389 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206658-TA
ATGGAGCTAAAGGCTAAGTTTAGTAGCATTTTGCGGTCGGGACCGCTTCTAGCTGCGAAAATTATATCTCAGACGAGACTGTTACTCAAGTTTATTGCCTTCTGGTTTGCAACACCTCGACCTAGTGGGGAGTCATCGTGTTTGCAAGGCCCTGACTGTTTTATGTGTTGGGAAAATTGCGAGCTTCTACAATCTAACTATCAAGTCTGGGGCGCTGTTTGTGATGAGAAAGATATTTGTTTTCCGGGATGCAAAGTTGCTTGTGAATTCCACACGGAAGCTGCCAGATCTTCCCAAACCCAACCTGTGATTCATACAAAAGGCGAAGGCGTCATGAGACTCAGCGGGGGCTTGGCCCGATGGCCACCACCGGCACCGCGCGCTGTCTCTCAGCCGACTCCTTTAGTTTATGTAGTAATGCGACGAGCAGCAGAAGGGCCTTGGCGTCAAATTATACAAACCCAAGCTTTAGCTACAAGGGTACCATCAAACAATGAAGGTGCTCTTCTGCGCGTACTCGTTGTTGATCCTCAAGGTCTTGTCACAATATACAGCCCGGATGAAACATGGCTAGCTCATGATACCAACACATCAAATCACGATGAACATAAATGGACGCTGAAAGAAATATCTCTTATACATCAAAAGGTACTTGTCATTGGAGAAATTGCGTGGGAACCTAGAATCGCACGTGGAGTATACCTTGTAACTTGGGAGGTTGATGGTGGAGGTTTGAAAGGAAATCTATTTACTGATTCCACACGCGTTACTTTATCCTTATGGCCAGATACAATTTACCATATTCAAGTTGAACTTGTGTCTAGAACACTGGGAGTAGATAATGAGAAGTCCGAAACTATGACAATTGATACAAGTCGTGCACAGCGTGTTTCTACGGAAAGCGTTCAGGATGTTGAAGTAAGTGAAGGCGATCGTCTTATGTCAGTACTAGTTAGTTCAATGAGAGGCGAACGCGTGCCGGAACGTGCCCCCGATTCTGAATTAGTATTAGGATGTTTGTCAGCTATGATAGCTTTTGGATTAGCGGTGCTTGTTGCGATCGTTTGGAGGCGAAGACGACGGGGTACTTTTCCTGGAACTAATGTTTATGTTGGTTCTTCTTCTGTTCTTAAACGAAAGTTAGTTGAAGATTTTGTTCCAGCGCAACATTGCTTTCAGCCTGTCCTTGCTCCTACAACTCCATGCAATGGGACAGCACCAAGGGATGTAGTTGTGTGA

Protein sequence:

>DPOGS206658-PA
MELKAKFSSILRSGPLLAAKIISQTRLLLKFIAFWFATPRPSGESSCLQGPDCFMCWENCELLQSNYQVWGAVCDEKDICFPGCKVACEFHTEAARSSQTQPVIHTKGEGVMRLSGGLARWPPPAPRAVSQPTPLVYVVMRRAAEGPWRQIIQTQALATRVPSNNEGALLRVLVVDPQGLVTIYSPDETWLAHDTNTSNHDEHKWTLKEISLIHQKVLVIGEIAWEPRIARGVYLVTWEVDGGGLKGNLFTDSTRVTLSLWPDTIYHIQVELVSRTLGVDNEKSETMTIDTSRAQRVSTESVQDVEVSEGDRLMSVLVSSMRGERVPERAPDSELVLGCLSAMIAFGLAVLVAIVWRRRRRGTFPGTNVYVGSSSVLKRKLVEDFVPAQHCFQPVLAPTTPCNGTAPRDVVV-