Monarch geneset OGS2.0

DPOGS210078
TranscriptDPOGS210078-TA1770 bp
ProteinDPOGS210078-PA589 aa
Genomic positionDPSCF300017 - 127767-133303
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0023340.070.81% 
BombyxBGIBMGA012744-TA0.067.38% 
DrosophilaCG6652-PA5e-3636.92% 
EBI UniRef50UniRef50_UPI00022463928e-5052.41%UPI0002246392 related cluster n=1 Tax=unknown RepID=UPI0002246392
NCBI RefSeqXP_001605681.12e-5052.41%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3454803273e-4952.41%PREDICTED: hypothetical protein LOC100122077 [Nasonia vitripennis]
NCBI nr blastxgi|3838629836e-5631.21%PREDICTED: uncharacterized protein LOC100881552 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL26005 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210078-TA
ATGTCAGTTCTCAGTCTCGTGCCAGAAGAGAAACGAAAAGATCAAAGATCGTCTTTAGAGAGTGTATACAGCAGTAACTCCAGACTGAATTTGTTAAACAAACGGAAGCGCTATACTCCGGACATGTCAACATCGTCTAAGGGTGATAAATATGTGACACAGAGAGTGCTCTCAGCTAAGACGCATCGTGTTAAACAACTGCAAAATCAGCTGGCGGACGCACACTATCATTTACAGGAGCTTAGTAACGAGAACAAAGTACTCCGGGCCTTGCAAAAGAAACAAGAAATCGCTCTGCAGAGATACGAGAATTCAAACGCAGAACTGCCCCAGGTGTTGAGTTCTCACAGCGAAGAGATGCGAATCCAGCAATGCAAGTACAAGCAGCTGAAGACACAGTACAAAGATATCACACAGAAGTTGAAGGAGAGGGAAATGCAGTTGCAGCAGTTGAGGGACGAACACCTGCATCTAGTTGAACTTAGCAAGGATAGGAATCTGTTAGAACGGGAGAAGTTACAATCTCAAGTAACGGAACTGAATGCGAAGGTTCAACAACAGACTGAAACTATAAACATGCTACAAAGACGAATAGCGCTCGAGGCGAAAAACTTCAAACACCAACTACAGAACGAAATGAACAAACACAAAGACACTAGACACGATCTGGATCTTGCTATAAGCAATGCTGATAAACTATCTACCATTATAGAGATGAAAGAGAAAATGCTTAGCACAGTCGCCGGTAGAGGGGGCAAGTCGCCAACAAAAGTAGCTTCAGTGACTAACATAAGATCGGTAAGCAAAAACACAAGAGACACGGGAAAAGTGAATGACGAACGTGACGGTGCGGTGCATATGGATCAAAATCTTTTAGCAAAGCTCTGTGAAAATTCACGAAACCTCAGTAGTTCGCTGTCACACGAAGAAGACACGAGCTCGTCAACCGAAACACGCTCGCGGTACGTCTCCAGCCGCACCTCCACCAGTAACAGAACAACCCCCAGTCAAAACAGAAGAGGTTCCAAAGGCTCCGTTGAGATAATTGAGTTGGCGAAAACTGTACAAGAAGGAATGGCGGACTTAGCGATAGTTGACGATACGTTCCAAGAATCGACACCTGAAGAGATGCAAAAGAAGATGGAAGCGATGAAGCAGGATCTTATGAATAAAATAAAAAATAACGAAGAACCCATATCACGGAAGTCCAGCGCCATAAGACGGAAGTCGACGGAAGAGTCCATCGAGGAAGAAATAATAGAAGTCATAGAACGACCCAAATCAAGAGGCAGGAGGAATTCTACGGTGTCATTTTACGACAGTTCAGACACGATCAACTATAGCACCGGCGACGATATAGAGAAAGTTCAGAATAAAAACCGCACAGAGCGGAAATTGTCGGGGAAACATATCGATAAGTACTGCAAAGACATCATCCAAGATATAGAGAAGAGCAGCAAGGTCATCGACGTGCACATGAAACAGTTCAGTCAGTCGAAGTTTGCCGGCGATAAGTTGGTGGAGCAGCTGCAGGCCGTGGACCAACTCAACCAATACGTCAACGGAAGCGGAGACATACCCGACGAGGCCTTCGCGGAGTTGAATAATAACTTCAAAATGCTCACCGACCAAGTGTTGAACGAAGCGCCCGTCGCCAAGAAGAGGCTGTCGAGCAGACGCGGCTCCCGAATAAATTCAGCCAATGATCTCCTCGGTGACGGGAATATGTCCAATCAGGATTTGCTCGACGATTTACTCGGAAAGAAATGA

Protein sequence:

>DPOGS210078-PA
MSVLSLVPEEKRKDQRSSLESVYSSNSRLNLLNKRKRYTPDMSTSSKGDKYVTQRVLSAKTHRVKQLQNQLADAHYHLQELSNENKVLRALQKKQEIALQRYENSNAELPQVLSSHSEEMRIQQCKYKQLKTQYKDITQKLKEREMQLQQLRDEHLHLVELSKDRNLLEREKLQSQVTELNAKVQQQTETINMLQRRIALEAKNFKHQLQNEMNKHKDTRHDLDLAISNADKLSTIIEMKEKMLSTVAGRGGKSPTKVASVTNIRSVSKNTRDTGKVNDERDGAVHMDQNLLAKLCENSRNLSSSLSHEEDTSSSTETRSRYVSSRTSTSNRTTPSQNRRGSKGSVEIIELAKTVQEGMADLAIVDDTFQESTPEEMQKKMEAMKQDLMNKIKNNEEPISRKSSAIRRKSTEESIEEEIIEVIERPKSRGRRNSTVSFYDSSDTINYSTGDDIEKVQNKNRTERKLSGKHIDKYCKDIIQDIEKSSKVIDVHMKQFSQSKFAGDKLVEQLQAVDQLNQYVNGSGDIPDEAFAELNNNFKMLTDQVLNEAPVAKKRLSSRRGSRINSANDLLGDGNMSNQDLLDDLLGKK-