Monarch geneset OGS2.0

DPOGS215155
TranscriptDPOGS215155-TA1809 bp
ProteinDPOGS215155-PA602 aa
Genomic positionDPSCF300348 + 88-6934
RNAseq coverage20x (Rank: top 80%)
Annotation
HeliconiusHMEL0173722e-5459.02% 
BombyxBGIBMGA013922-TA6e-11257.98% 
DrosophilaIncenp-PA1e-1728.43% 
EBI UniRef50UniRef50_UPI00020647C14e-2135.41%UPI00020647C1 related cluster n=1 Tax=unknown RepID=UPI00020647C1
NCBI RefSeqXP_002063196.12e-1728.83%GK21798 [Drosophila willistoni]
NCBI nr blastpgi|3838557441e-2033.15%PREDICTED: uncharacterized protein LOC100880529 [Megachile rotundata]
NCBI nr blastxgi|3071976148e-4326.28%hypothetical protein EAI_14357 [Harpegnathos saltator]
Group
KEGG pathway 
Orthology groupMCL21035 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215155-TA
ATGAACTTTCATATAAACGCCGCTATAACAATAGAACCGCGCAGGAAGGTCGACAAACCGGATGACGTTAGCGACGGAGACACGGACAGGAGACGAGCGACCCGGAACAGGAAACAGACGGACACCATGACGCCACAGACAGCCGCGCGACCCGTCCGCTCGAGTCGTGCCAGACTCACAGAACAAGAAGACAGAGAAGGAAACAGGAGAACCAGGGGGAAGAAGGCTCAAGAAACATCAACCGTTGAGGCGGAAAAGGAAACTTCTGTCGAAAGCGTAACCGCGTCCCCGGCTGAGAAACCGCGGCCGAAACGTACGAGGAGAGCGAGAAAAGTTTCTGAAAAACAAGAAAACGACGATAAAACTGAAAACGACATTGAAAAAAAGGAAAGTGATTTGGAAATAAAACCAGTAGATACGTCAGACAGCGAGGCGCGGTCGCCCGTGCTGCAGACGATAGTGCCGAAGACGAAACTCAGGGTGCGCGAGGACTGCGGAGACGACGCAGACCAGCAGGGGCGGGGCGCGCGGGCCGCGGGCGAGCAGCCCCGGCGGCGGAAAATGAACGAGACGGTGGTGCTAGACAAGCAGAACAGAACAGAAGACAAGCGGAACGACGGGATAATGAACGAAACAGTGGTGTTGGAGAAAGATAAGGCTGCAGCTGAACCGGGCGACGACTCGCTCTTGACTGACGACGAATCGCTGGAAATGAAGACGCCGCCTAACAGACAGCTGCCGGAGCCCACTTCCGCTGTAAAAGAGAAAGTGCAACAGTTTGAAGAAATGGCTACGAGAGTGACCCGCACTAAGACTAGAGCTATGACTAAAAAGGAGGTCCCAGTGGACCCAGACACTCAGACGCCGCCGGACAGGACGAGGCCGGTTATATCGACGGACACCCTCAGCAAGATGAACAACCTCATATTCAACGGAAAACCACCGCAGATATCATCGTCGGCGTCGAAGCCTCGTTCTAACATCCCTATGAAGACTTCGGTAACAGCCTCCGCCTCTAAGATAAGTGTCGCCAGAGACGACGAGAGAAGAGAAAAAGAGGACGCGAGAAGGAAGAAGGAGGCGATACTAGAGGCTAAGAAGGAGATGCAACGAAGAAAGAGGGAAGAGAAAATGTCAGCGGCTGCAGCGGCTAGAACGGCGGCTGAGAACATGAGACGTGCAGCGCTTCAAGCAGCCGAGAAGGAAAGACGGGAGAGGCAGATACAGGCCGACCAGGGGAGGATGGATAGACTTAAAGAGGTCGAGAAGAAAAAGTTGGAGCAAGCACGTAAGGCTGCCGAGACAGAGGAACGGAGGAAGCTAGAGGAAGCTGCACGAGCCAGTAGACTGCAGAACGAACAAAGGAAAGTCGAGGAGGCTAGGAGGAGGCAGCTGGAGGAGGAAAAGATCATGAAGAAGGAAGCAGCTCAGATGCAGAAAGAGATAGAGCGGAGACAACGAGAGTTCATGGAGAGAATGAAGATGAAGAAATTAGAGGGAGACAGAACACCCAACAAGATGGCGGCCATAGAGCCCGTGTACATGCAGGACGGCTTCCAACACCTCAACTCCGATGAAGAAGAACCCCCGGAGAGACCACCACCAGTATGGAGCACCTCCAAGAACCGTCGCATTCAACTGTCGATCCAGTCCCGTATCAGCCAGCATCACATCGACCGTCTCTTCTCAGTGAGGGAGCACACTCCGGACCTAAGGGAGATCTTCCCTAACATAGAGCGAGCCCGCCTCAAGAGAACGTCCTCCGCCGTTTGGAGGACACCGCCCAGGCTGGCCACGCTCGACGAGTGA

Protein sequence:

>DPOGS215155-PA
MNFHINAAITIEPRRKVDKPDDVSDGDTDRRRATRNRKQTDTMTPQTAARPVRSSRARLTEQEDREGNRRTRGKKAQETSTVEAEKETSVESVTASPAEKPRPKRTRRARKVSEKQENDDKTENDIEKKESDLEIKPVDTSDSEARSPVLQTIVPKTKLRVREDCGDDADQQGRGARAAGEQPRRRKMNETVVLDKQNRTEDKRNDGIMNETVVLEKDKAAAEPGDDSLLTDDESLEMKTPPNRQLPEPTSAVKEKVQQFEEMATRVTRTKTRAMTKKEVPVDPDTQTPPDRTRPVISTDTLSKMNNLIFNGKPPQISSSASKPRSNIPMKTSVTASASKISVARDDERREKEDARRKKEAILEAKKEMQRRKREEKMSAAAAARTAAENMRRAALQAAEKERRERQIQADQGRMDRLKEVEKKKLEQARKAAETEERRKLEEAARASRLQNEQRKVEEARRRQLEEEKIMKKEAAQMQKEIERRQREFMERMKMKKLEGDRTPNKMAAIEPVYMQDGFQHLNSDEEEPPERPPPVWSTSKNRRIQLSIQSRISQHHIDRLFSVREHTPDLREIFPNIERARLKRTSSAVWRTPPRLATLDE-