Monarch geneset OGS2.0

DPOGS204287
TranscriptDPOGS204287-TA2523 bp
ProteinDPOGS204287-PA840 aa
Genomic positionDPSCF300046 + 212349-228142
RNAseq coverage1163x (Rank: top 11%)
Annotation
Heliconius% 
BombyxBGIBMGA007564-TA0.075.91% 
Drosophilal(3)82Fd-PV6e-16143.24% 
EBI UniRef50UniRef50_B9WZ570.071.03%BmOXR1 protein n=3 Tax=Obtectomera RepID=B9WZ57_BOMMO
NCBI RefSeqNP_001139127.10.071.03%oxygen resistance gene 1 [Bombyx mori]
NCBI nr blastpgi|2245869010.071.03%oxygen resistance gene 1 [Bombyx mori]
NCBI nr blastxgi|2245869010.071.03%oxygen resistance gene 1 [Bombyx mori]
Group
Gene OntologyGO:00169989e-11cell wall macromolecule catabolic process
KEGG pathwaybfo:BRAFLDRAFT_1248866e-18 
 K12587 (EXOSC6, MTR3)maps-> RNA degradation
InterPro domain[85-127] IPR0183929e-11Peptidoglycan-binding lysin domain
[84-127] IPR0024822.8e-10Peptidoglycan-binding Lysin subgroup
Orthology groupMCL16487 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204287-TA
ATGGCCTCGCTAGACTGTAGTGGTTGTGTTGCATGCAGGTCTAGGAGGGCTTCCCAGCAGCTATCCAGAAGTGCCGAGAGCATCGACCAGTCCTACAAGAGGTCAAAGAGCCGTTCTGTAGATCATGGGTTCGCAGCCCCCTTCGACCTGGAATCGCTGCGCTCCAAGGTGGAGGGCAGGTTCGACGCTGTTAATAAAGATGAAGACGAAAACAAGAGGCTGCCTCCGCCGCCTCCGATCAAAACAATGACCTATACGGTGAGAGATCGCGATACATTGACGTCCCTGGCAGCCCGCTTCGACACCACGCCCTCGGAGTTGACGAAGCTGAACCGTCTCGCCACGCAGTTCATCTTCCCTGGACAGAGGCTGCTCGTACCCGATAAGAGAAAAGATGGCGATAATGATGCCGATGTTCGATCTGACAGCGATAACACCTCGGAATCAACCCAAGCCGCCAGCGAACCGGCCCAGGAGAAAGAGATCTTGGACAGTCTCCGGCCTGTGTCCCCTGAGGCCCCCGCCAGCAGCGCCCCCCAGCGGTTCCTAAAGATCAACGTGAGACACATCACCGATGGACAGGGCGTGGTGTCCGGCGTGCTGCTGGTCACACCGAACGCGGTCATGTTCGACCCCAACGTGTCTGACACTCTGGTGATGGAGCACGGGCCTGAGTCGTACGGTGTGATAGCGCCCATGGAGTACGTCGTGAACGCTGCCATCTTCTACGACATAGCACACATGAGGACCCACGGACCGGACCACAATGCGCACAGAGCCGAGCAAGCCGAGATATACTATATGAATAAACCGGAACACGCCCCTGAAAAAGATCTAGTCAAAGAGGAGACATTCCCTGAGCTGCAGGCAGTTGGGGAGGCTGCGGAACCCCAGCAAACTACAGGGGAGGGAGCAGGGGAAGCCGTTGGGGGTGAGGAAGATACCCCTAGACGACCTTCCGACGCTGAGGAAAGAGGAGGAGCAGCCTTCCCAAAGGCCTTCGACAGGGACCTGGTTACCCCGACCCCAATACATCAACAGACAACTGAGGAGAGACGAAAGTCGCTACTGGATCATCACTGGGCTATACCGAGCAAGGACAGAATGTCCATCGATCAAGGGAGTGAGGTGTCCTCCAGCGTTGATCACGGCAAGGAGGAAAGTCTGGAGGTGTCAGTGGCAGAATGCTCTGAAGCGGAGCGCGCTGAGGCCGGGGAGCACGGAGAGGGTGAGGGGGAAGGGGAGGGAGATGGCAGCCAGCCTGAGGACGCCGCACACCTCGTCAAGCTTTCCTACCACGACTCTGGAATCGATATACGAGATCCGCTGTTACATGTCAACACGGCTAATTCCAAGAAGGAGTTGTGTTACATTTGTCACAATACCGACACCATTTACGAGTTGCCGGAGACCGTGATGTCGAGCGACTGGCACTGTGACACCACATATGAGAACCGGTTCTTGAACTGGTTCGCCGTTTGTCATGACAGTGACGCTGTGTCGTGTTCCACCGACGAGGTATACAGCGACGCGGATATAGTTCTGTCAGCGGACTGGGTGCCTCCGGTCTGTCTGCCTCGCGCTGAGCCGCCATTGTCTGCTCCCCCTCTTGGAGAGAATGAGAGAGCTCGGAAACCAGCAGCGGTCTCTTTCTCACTCGATGGCAATCAGAAGGAGGATCCGTCGAAACCGGAAAAGAAGAACAAGATGTTAAAACGCCTGTCCTACCCTCTATCGTGGGTGGAGGGCTTGACTGGTGAGGGCCAGGCTGGGTCGCAGGCGGACTCGCTGCCGACCTCCGCCGACAGTCACAACACCAGCGTCTTCTCTAAAGTCTTCAACAGCTCACCAATGAACCTGGTGGAGTTCGGTACTGGTTTGTTCCTGAGTAAGACGCCCTCGGAGGAGGGTCCCGCCGACGGGCGAAGCTCGCTGGGAGGGTTCGGTCGTTCCCACGCCAAGCCAGCCAGCGCTCAACAACCGAGACTCGACTACCGCAGCATGGTGTCGGTGGACGACATGCCGGATCTGTTCGCGTCGTTTGATAAGCTCATCCCTCGTCCCGCTCGTCCCAGCGATGATCCTCCCCTCTACCTCCGTCTCCGTATGGGCAAGCCGGCCGGTCGACCTCTGCCGCGTTCCACTCCGCTGATGTCCTACGGAAGGAAGAGGATGAAGCCAGAATACTGGTTCGGTATTCCCAGGAACAGGGTCGACGATCTCTTCAAGTTCCTGACCCACTGGGTCCCTGAGAGGTACGGACCTCTCCGTGACGTCACCGCTCACGGCTACGAGCTTATAGACAGCGACACGGAGTGGGACGACGATGACGCCAAGCCGGGACAGAAGGAGCGTATCGGCAGTACTGGAGATGTTTCGGATATAACGAGAGAATCCTGGGAGCTGCTGAAGGCGCCCTACGTGAAAATATATTCTATAATGAAGAGTCAGGCTGAAGCGCTCGGGGATTCACTAGGCGAAGAGCCGCAGACCGAGGTATCATTAGTAGGATGTGTTGTTTAG

Protein sequence:

>DPOGS204287-PA
MASLDCSGCVACRSRRASQQLSRSAESIDQSYKRSKSRSVDHGFAAPFDLESLRSKVEGRFDAVNKDEDENKRLPPPPPIKTMTYTVRDRDTLTSLAARFDTTPSELTKLNRLATQFIFPGQRLLVPDKRKDGDNDADVRSDSDNTSESTQAASEPAQEKEILDSLRPVSPEAPASSAPQRFLKINVRHITDGQGVVSGVLLVTPNAVMFDPNVSDTLVMEHGPESYGVIAPMEYVVNAAIFYDIAHMRTHGPDHNAHRAEQAEIYYMNKPEHAPEKDLVKEETFPELQAVGEAAEPQQTTGEGAGEAVGGEEDTPRRPSDAEERGGAAFPKAFDRDLVTPTPIHQQTTEERRKSLLDHHWAIPSKDRMSIDQGSEVSSSVDHGKEESLEVSVAECSEAERAEAGEHGEGEGEGEGDGSQPEDAAHLVKLSYHDSGIDIRDPLLHVNTANSKKELCYICHNTDTIYELPETVMSSDWHCDTTYENRFLNWFAVCHDSDAVSCSTDEVYSDADIVLSADWVPPVCLPRAEPPLSAPPLGENERARKPAAVSFSLDGNQKEDPSKPEKKNKMLKRLSYPLSWVEGLTGEGQAGSQADSLPTSADSHNTSVFSKVFNSSPMNLVEFGTGLFLSKTPSEEGPADGRSSLGGFGRSHAKPASAQQPRLDYRSMVSVDDMPDLFASFDKLIPRPARPSDDPPLYLRLRMGKPAGRPLPRSTPLMSYGRKRMKPEYWFGIPRNRVDDLFKFLTHWVPERYGPLRDVTAHGYELIDSDTEWDDDDAKPGQKERIGSTGDVSDITRESWELLKAPYVKIYSIMKSQAEALGDSLGEEPQTEVSLVGCVV-