Monarch geneset OGS2.0

DPOGS207467
TranscriptDPOGS207467-TA1986 bp
ProteinDPOGS207467-PA661 aa
Genomic positionDPSCF300051 + 35630-44439
RNAseq coverage70x (Rank: top 66%)
Annotation
HeliconiusHMEL0166852e-11967.66% 
BombyxBGIBMGA001173-TA3e-10756.13% 
DrosophilaCG14120-PA1e-5928.94% 
EBI UniRef50UniRef50_Q08JX12e-11064.61%Alkaline nuclease n=5 Tax=Obtectomera RepID=Q08JX1_BOMMO
NCBI RefSeqNP_001091744.13e-11164.61%alkaline nuclease [Bombyx mori]
NCBI nr blastpgi|3274204704e-11667.65%dsRNase [Mamestra configurata]
NCBI nr blastxgi|3274204703e-11967.65%dsRNase [Mamestra configurata]
Group
Gene OntologyGO:00468724.8e-39metal ion binding
GO:00167874.8e-39hydrolase activity
GO:00036764.8e-39nucleic acid binding
KEGG pathway 
InterPro domain[391-644] IPR0208214.8e-39Extracellular Endonuclease, subunit A
[404-643] IPR0016041.1e-37DNA/RNA non-specific endonuclease
Orthology groupMCL12709 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207467-TA
ATGACGCTAGCCGTCATTGCTCATCCCACCAGGATGGTTCCACCTTCAGAGATGGCTCTTATTCTTGGTGAAGAAGAGTTTGAGGACTACTTAGACAAATATCTAGCTATAGAACAAAAGAATTGGATTAATGCTACTTCCTCTCCTCGCAGTGGGTGTACTTTGAGAGTCAACGGTGACTTTGGTCAACCTCAACCTGTATATATCAACAGGAGGACTAATAATTATTTGGCAGCCAGTGGAAACTCAGGTCAAATACGTCTCGGTGGCGGAGAAGAAGTAATCGTTTCCTGTCCAAACAACCAAGCCATCCGCCATCCGAACATTGTATCTACAGTCCATACCGCTACCGCCAGGTGTGTTAGCAATAATTTATTTTCTGGAGCTGGATGGTTAAATGGTAACAGAGCTTTTGGACAACTAACTTGTGCTGGTCACGCTTTCTACCAAGCTCAGCAGACAAGTACAAGATGCTTCAACAACGGCGTTGTAATTCGCGTGGGCTTTTCTGTGAACAATCGATTTTATCCTCTCTACCAATCCTGCTACAATCAAAACCGTATGGAAGTTTTGTACGTATGGTACAACCAAAACGCTAACAATGCTGTGCATCAAAACGGAGTTGGTAGACCAAGTTGGTTAGTTCCCGTACCTCAATACTTCTATAAGGTGGTATACGATCAATCCAGACGTCGTGGTACAGCATTCGTCAGTATAAATAATCCTCATTACACTCTGGCGGAAGCTCGCAATTTGCAGTTTTGCACTGATCGTTGTCGCAATAACAACGCGTTCAGCTGGATCAACTGGAGGCCTGATCGCCTGGACCTGGGATATAGCTTCTGCTGCACGATCTCCGACTTCAGAAGAGTTATCGGTCACATACCGAACTTCGATGTTTCTAATGCGCTAATACTATCAGCGATGACGCTAGCCGCCATGGCTCATCCCACCAGGATGGTTCCACCTTCAGAGATGGCTCTTATTCTTGGTGAAGAAGAATTTGAGGACTACCTAGATACATATCTAGCTATAGAACAAAAGAATTGGATTAATGCTACTTCCTCTCCACGCAGTGGATGTACTTTAAGAGTTAACGGCGACTTTGGTCAACCCCAGCCTGTATATATCAGTAAGAGGACTAATAATTATTTGGCAGCCAGTGACAATTCAGAAATCATGTCTGTGGGCTTTTCTGTCAACAATGTATTTTATCCACTCTACCAATCCTGCTACAATCAAAACCGTATGGAAGTTTTGTACGTATGGTACAACCAAAACGCTAACAATGCTGTGCATCAAACCGGAGTTAGTAGGCCAAGTTGGTCAGCTGGAGGCTTCTTCCCTGGTGTCAACATAAACAACGTTTACACACAAGCTAGCCAAAAGACAGCTATTGCAAGACTAGTAGGAAATGCCTTAGCTGATAAATATGTAACGAACAATCAATTCCTTGCTCGTGGTCATCTTGCTGCTAAAAGTGACTACGTTTTCGCTACTGGACAACGTGCCACCTTCTTCTTCATCAACGCTGCTCCCCAATGGCAACCCTTCAATGCCGGCAATTGGAACAATCTGGAACTGAACCTGCGTGCACGTATTGGTAGAGCTAGATATAACACAGTAATTTACAGTGGAACATTTGGTGTTACCCAATTACGCAATTCTAACGGAAAAATGGTAAACATATTTTTGGACAATAACAGAGTTCCCGTACCTCAATACTTCTATAAGGTTGTGTACGATCAATCCAGACGTCGTGGTACAGCATTCGTCAGTATAAATAATCCTCATTACACTCTGGCGGAAGCTCGCAAACTGCAGTTTTGCACTGATCGTTGTCGCAATAACAATGCGTTCAGCTGGATCAACTGGAGGCCTGATCGCCTGGACCTGGGTTATAGCTTCTGCTGCACGATCTCCGACTTCAGAAGAGTTATCGGTCACATACCGAACTTCGATGTTTCTAATGGTCTTCTTTCTTAA

Protein sequence:

>DPOGS207467-PA
MTLAVIAHPTRMVPPSEMALILGEEEFEDYLDKYLAIEQKNWINATSSPRSGCTLRVNGDFGQPQPVYINRRTNNYLAASGNSGQIRLGGGEEVIVSCPNNQAIRHPNIVSTVHTATARCVSNNLFSGAGWLNGNRAFGQLTCAGHAFYQAQQTSTRCFNNGVVIRVGFSVNNRFYPLYQSCYNQNRMEVLYVWYNQNANNAVHQNGVGRPSWLVPVPQYFYKVVYDQSRRRGTAFVSINNPHYTLAEARNLQFCTDRCRNNNAFSWINWRPDRLDLGYSFCCTISDFRRVIGHIPNFDVSNALILSAMTLAAMAHPTRMVPPSEMALILGEEEFEDYLDTYLAIEQKNWINATSSPRSGCTLRVNGDFGQPQPVYISKRTNNYLAASDNSEIMSVGFSVNNVFYPLYQSCYNQNRMEVLYVWYNQNANNAVHQTGVSRPSWSAGGFFPGVNINNVYTQASQKTAIARLVGNALADKYVTNNQFLARGHLAAKSDYVFATGQRATFFFINAAPQWQPFNAGNWNNLELNLRARIGRARYNTVIYSGTFGVTQLRNSNGKMVNIFLDNNRVPVPQYFYKVVYDQSRRRGTAFVSINNPHYTLAEARKLQFCTDRCRNNNAFSWINWRPDRLDLGYSFCCTISDFRRVIGHIPNFDVSNGLLS-