Monarch geneset OGS2.0

DPOGS208417
TranscriptDPOGS208417-TA1527 bp
ProteinDPOGS208417-PA508 aa
Genomic positionDPSCF300241 + 152560-154086
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0208119e-14450.69% 
BombyxBGIBMGA004069-TA2e-13447.35% 
DrosophilaCG33489-PA2e-1022.75% 
EBI UniRef50UniRef50_UPI00017582981e-4428.87%UPI0001758298 related cluster n=1 Tax=unknown RepID=UPI0001758298
NCBI RefSeqXP_001815593.12e-4528.87%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|1892374334e-4428.87%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastxgi|1892374338e-4628.87%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL20411 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208417-TA
ATGTTTATTGATCGGAATAATAAAATATGTGCTGCTGGTAAACAAACTGATCAACCTCTAGAAAATATAAGCTCTTCATTAAAATGGTATAGCTTACAAGACGAAGTTGATGCGCTAATTAGTGATGCTTTTGAGCCTTCCACTTTGAAGACTGAATTTCCTCCAATAAAGGATATAAAAAAACGGGATATGCGACATGCAAGTATTTTTAGTGAGGGTTCTGAACTCATCAATCCCCCAGTTAAATCAAAAATGCAAACTCTGGTAGAAGACTTTAAAAATACATGTTATACATCATACTGGAAAAAAGAAGTAGGTAAAATTTCTGATCCTGTGCCTACACTCCCTGAAGGTTTTAATATTTATGGTACCGTATGTGGTTTGGAAAAGCGAACTCATGATCGTCTATATGATATAATTTTGCCAAAGAATCCTATTATCGATAAAACGCCGATATCAAAAAGTCCAGGCGTCCAGAAAAAACGCAATTACTGTTTTTTTAACAAAAAATCTACATTTGGAATAAAGTGTGAGGGAGATCGTACTGGGAAGTGTATGAGATGTTGTCTTACTGATGATAGGGTAAATTTAGGTACTGCATTAAAACAACCAATAGCTGACATACAGGCCAAATACAAAAAAGAAACAGCTCCAAAATTGTCGACATCTAGCATGCCTAATAATAATGCTAGTCGTGTTCCGGAAGGTTTTGCTTTTGGAAAAATACAGCCACCAGGAGATCAACTAGTAGAGTGTTTGAGGTCTTGTGAAGTAAACAAATATAAAGAATTACTTATACAATGCTTAGGTCATTTAAATACTGTAAGAAAATGTTTGAGTAAGCGCTTTGATGGATCATTTTTCTATCAATTTTATTTACAGTTAAAATTTTTAGATAAAGCGAATACGTGTTGGTTACCAAAACAAGTCGTTTATGACCAATGTAAAATAAGATTCATACATTTCCAGTCATCCTTAATTGAACCTCTCTTGTCAGTTTGGAAAGCGTTTGATGAAGTTAATAATAAAATTAAATATAAATCATTTATCCATGTAATTAATTATAGAGAGCCATCGCCTGAACTGCCAAAAATTTCTGATGTACCGGAAGAATATTTAGATTTCAGAACGACGTATGGAGATATGGTTAGAGAAAATCAAGTAATTGATACAAGGAACATGGCTGGTATTCCTTCAGGAAGATATTTGGACAAAGATTATCCCATTACCCCTGAAGGTTGCTGCAAAGCCGATAGAACTTATCTTCCTCATGAATCTGACGCTAGAACCTGTTTGCTTCCAAGTATACTCACGTGTTTAGGTCTAAGCCATCGTGATTTATACGCTCGACGTGATCGTAAAACAATTATGGAAGTATTTGAAAGAGCTGGTTATAAATTGGATGACGAAAAATTTAAAAAAGTTTGGGAAATGGCTGAGAAGTATCATTCACAAAAATGGGTTTGCTATGAAACCTTTCGGAAATGCTTATGCGATTATGAAAAGAATCAAGATAGAAAAACTTGA

Protein sequence:

>DPOGS208417-PA
MFIDRNNKICAAGKQTDQPLENISSSLKWYSLQDEVDALISDAFEPSTLKTEFPPIKDIKKRDMRHASIFSEGSELINPPVKSKMQTLVEDFKNTCYTSYWKKEVGKISDPVPTLPEGFNIYGTVCGLEKRTHDRLYDIILPKNPIIDKTPISKSPGVQKKRNYCFFNKKSTFGIKCEGDRTGKCMRCCLTDDRVNLGTALKQPIADIQAKYKKETAPKLSTSSMPNNNASRVPEGFAFGKIQPPGDQLVECLRSCEVNKYKELLIQCLGHLNTVRKCLSKRFDGSFFYQFYLQLKFLDKANTCWLPKQVVYDQCKIRFIHFQSSLIEPLLSVWKAFDEVNNKIKYKSFIHVINYREPSPELPKISDVPEEYLDFRTTYGDMVRENQVIDTRNMAGIPSGRYLDKDYPITPEGCCKADRTYLPHESDARTCLLPSILTCLGLSHRDLYARRDRKTIMEVFERAGYKLDDEKFKKVWEMAEKYHSQKWVCYETFRKCLCDYEKNQDRKT-