Monarch geneset OGS2.0

DPOGS200251
TranscriptDPOGS200251-TA1413 bp
ProteinDPOGS200251-PA470 aa
Genomic positionDPSCF300169 + 106863-110428
RNAseq coverage8x (Rank: top 85%)
Annotation
HeliconiusHMEL0106660.076.67% 
BombyxBGIBMGA000012-TA1e-14663.76% 
DrosophilaOsi9-PA2e-2034.88% 
EBI UniRef50UniRef50_B6DXB02e-4344.90%Osiris 9 n=2 Tax=Obtectomera RepID=B6DXB0_BOMMO
NCBI RefSeqNP_001129360.14e-4444.90%osiris 9 [Bombyx mori]
NCBI nr blastpgi|2095714548e-4344.90%osiris 9 precursor [Bombyx mori]
NCBI nr blastxgi|2095714549e-4644.90%osiris 9 precursor [Bombyx mori]
Group
KEGG pathway 
InterPro domain[21-247] IPR0124649.6e-34Protein of unknown function DUF1676
Orthology groupMCL26464 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200251-TA
ATGTGGAAAAATATAGTTTTTCTTGCCTTCGTGGCAACAGTGTATTGTAATCCAGTGGAACGCGGTATAGAAGAAAATTTGCTTGGTGCTGTTTCGGAATGCATAGACAAGGATACTTCGCTATGCTTGAAGGAAAAGGCATTGAAATTCACCGAAAAACTGTCTATCAGCAAGGATTTGAATATATTTGAAGGCATGTCCCTTATCAATACTGGTTCAGCTCGATCTGCTCGCAGTTTTGAACAACTTTCCGAAGACCCAAAGACTAGAGAAGTTCAAATTGAGGAGAGAATAGCCACCAATGTTGGAGACTTCCTCGATAATCATGTACTTCAGTTACGTCTCTCTGAAGATTCCGGTGAAGCCAGAGACTTGGACGAGGAAGGTCGTGGCAAGAAGAAGAAGAAGATCAAGAAGATCCTTCCTCTCCTTCTTCTCCTGAAGTTGAAACTTGCTGCTCTCATTCCCTTGTTCCTTGGAGTTATCGCTTTTGTGGCAATAAAGGCCGTCTTTCTAGGAAAGATTGCATTCGCTATGAATGCTTTTACTCTGATAAGGAAACTTTTGGCCAAGAACAATTCTGGTTCATCGGGCGGAGCTATAAATTATGCTCCACATCATCCTGAAGAACACCCTGGCTACTCCTACGAACCAGCTCAGGGTTGGAGTAGGAAAGTCAATGATGCCCAGAACATGGCTTATGCTGTGGCTGTAGTGAGCTCAGTTCCAGTGTCTCAAGAAGATTCAGTGTTTCGTTTGGTTGCTCAAAATTTTGTGAATTGCATGAACAGTGATTTGAATGTGTGCTTAAAGGAACACGCACTCAAGGCTGCAGAGCGACTTGGTACCGTCCGTAAATTGAATATCATTGATGGTGTTACTCTTTACAACAATGGACCTAAAGAAGCCAGGAGTTTTGAAGCTCTTTCAAGCGACCCAGAAGCTAGAAACAAACAGCTGACTGAACGTCTTTGGGACAGCACATCTGATCTGCTTCAAAAAAGCGAATTGGAATTTAGCTTTGCTGGCTCAGACGACGACGAGGATGATGAATCTAGGTCCTTCAATGAAGCCGAAGAAGGCCGTGGCAAGAAGAAGAAGCAACTGAAAAAGAAACTCAAACTTCTGATCCCTCTCGCAATCTTAGCTAAAGCCAAAGCTATCGCTCTCGTTGTAATAGCCCTCTTAGTTATCGCTGCATCTGTCTTCAAAATTGCCCTTTTGGCTAAGATAGCCTTCATTGCGAAGGTGATTGCTATAGTCAAGGCCCTTCTAGCCAAAAAGCACGCTCAAGAAGAGCACGGATGGGTAGCTCATGAAGAACATGGTCATCCATCTGCCGGATGGGAAGGAGGCTGGTCCCGGTCAAGGAACGAAGCCAATAGCTTGGCTTACTCCGCATACCAACAGTAA

Protein sequence:

>DPOGS200251-PA
MWKNIVFLAFVATVYCNPVERGIEENLLGAVSECIDKDTSLCLKEKALKFTEKLSISKDLNIFEGMSLINTGSARSARSFEQLSEDPKTREVQIEERIATNVGDFLDNHVLQLRLSEDSGEARDLDEEGRGKKKKKIKKILPLLLLLKLKLAALIPLFLGVIAFVAIKAVFLGKIAFAMNAFTLIRKLLAKNNSGSSGGAINYAPHHPEEHPGYSYEPAQGWSRKVNDAQNMAYAVAVVSSVPVSQEDSVFRLVAQNFVNCMNSDLNVCLKEHALKAAERLGTVRKLNIIDGVTLYNNGPKEARSFEALSSDPEARNKQLTERLWDSTSDLLQKSELEFSFAGSDDDEDDESRSFNEAEEGRGKKKKQLKKKLKLLIPLAILAKAKAIALVVIALLVIAASVFKIALLAKIAFIAKVIAIVKALLAKKHAQEEHGWVAHEEHGHPSAGWEGGWSRSRNEANSLAYSAYQQ-