Monarch geneset OGS2.0

DPOGS201616
TranscriptDPOGS201616-TA1596 bp
ProteinDPOGS201616-PA531 aa
Genomic positionDPSCF300152 + 420567-428498
RNAseq coverage86x (Rank: top 63%)
Annotation
HeliconiusHMEL0166853e-5934.38% 
BombyxBGIBMGA009447-TA3e-5533.75% 
DrosophilaCG14120-PA1e-4831.68% 
EBI UniRef50UniRef50_Q08JX11e-5934.03%Alkaline nuclease n=5 Tax=Obtectomera RepID=Q08JX1_BOMMO
NCBI RefSeqNP_001091744.12e-6034.03%alkaline nuclease [Bombyx mori]
NCBI nr blastpgi|2152694221e-6135.13%putative dsRNase [Spodoptera frugiperda]
NCBI nr blastxgi|2152694222e-6035.31%putative dsRNase [Spodoptera frugiperda]
Group
Gene OntologyGO:00468721.3e-30metal ion binding
GO:00167871.3e-30hydrolase activity
GO:00036761.3e-30nucleic acid binding
KEGG pathway 
InterPro domain[137-360] IPR0208211.3e-30Extracellular Endonuclease, subunit A
[148-382] IPR0016041.1e-29DNA/RNA non-specific endonuclease
Orthology groupMCL22593 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201616-TA
ATGACACTACCCAGGGCCTGTGTATTGTTGTTCGTCACTTTAAATGTTCTCAACGAAGCCGTTGGAGGCTGCATTCTATCGTTAAAAAAGGATTTTCCGAAGACCTCAGTGGTATATTTACGTCATGGCCGTCTCCTGTCACCGGACGCTGTCACCGGTGACGTTCAGCTGACACGTTCGGAAACACTCCAAGTAGCATGTCCGGGAGAACAAAAATACATTGTACTTAGCAATAAAACCACCAATCACAGCTTACTTGACGTGAAATGCGTTTCAGACAGCTTGTTCCGCGCGAGTAGACTGTCTTGGATAGGGAACTTCAGCGAGATCAGATGTAACGCGCCGCCCTGGACCAGCGCTGAGGAGGTCGGGGGATGCGGAGCAGGAGCCAAGAATTATAGGCACGAGATGATCGGGTATAAGGTGTCAGGTCATTTCCACGGCCTGTACGAGGCGTGTTTCAACAAAGATCTCCTCAGCACTCTCTATGTGAAACAGGAGCTGGGTCCAGAGAGCGTCTTCATTCAGTCGGGGGGTAGACCCAACTTTGTGGAACAAGATTTCTTCGGAAAAGTGAAGATGTCAAAACTCTACCACCTCACCAACCAGAAGGCGAGATTCAAGCAGGTCCTTGGCGAGGGGAAGGAGGAGGAATATCTGACTAAGAAACAGTATCTGACCCGCGGACATCTCTCCCCCCGGGCCGACCACTCCCTGCTCTGTTCTCAGCGCGCCTCTTTCCTGTACCTGAACACGGCTCCTCAATGGAGACGGGGCAACGCCGGTGATTGGGCCGCCTTGGAAGAGGCTCTGCGCCGCAGAGTCCACAGCTACGGCCGTCCGGTGACGGTGTACACTGGCACCTTCGGAGTATCGACCCTGGGGGACACCTCCTCGAGACAGCAGCAGCTCTATCTAAGCGTGGATCAGAACAATAACGGGATCCTGCCCGTCCCGCTGTATTATTATAAGGTTGTGTTCGACGCCGTCAACAACACGGCCGCGGCGTTCGTGTCGATCAACTCGTCTTACTACAACCAAACCATGATCGAGAAGCTGACCTTCTGCGAGGACATCTGCGGCAGCAGGAACTACTCCTGGCTGAGGTGGAGGTCCAGCGACGGCACACACAGCTTCTGCTGCGAGTACCATGACTTCGTCAAAACGGTCCATGACCTACCCGGATTGAAGGCTCTACGCCGCAGAGTCCACAGCTACGGCCGTCCGGTGACGGTGTACACTGGCACGTTCGGAGTATCGACCCTGGGGGACACTTCCTCGAGACAGCAGCAGCTCTATCTAAGCGTGGATCAGAACAATAATGGGATCCTGCCCGTCCCGCTGTATTATTATAAGGTTGTGTTCGACGCCGCCAACAACACGGCCGCGGCGTTCGTGTCGATCAACTCGTCTTACTACAACCAAACAATGATCGAGAAGCTGACCTTCTGCGAGGACATCTGCGGCAGCAGGAACTACTCCTGGCTGAGGTGGAGGTCCAGCGACGGCACGCACAGCTTCTGCTGCGATTACCATGATTTCGTCAAAACGGTTCATGACCTACCCGGTTTGAAGGTCGAAGGCTTATTTTATTGA

Protein sequence:

>DPOGS201616-PA
MTLPRACVLLFVTLNVLNEAVGGCILSLKKDFPKTSVVYLRHGRLLSPDAVTGDVQLTRSETLQVACPGEQKYIVLSNKTTNHSLLDVKCVSDSLFRASRLSWIGNFSEIRCNAPPWTSAEEVGGCGAGAKNYRHEMIGYKVSGHFHGLYEACFNKDLLSTLYVKQELGPESVFIQSGGRPNFVEQDFFGKVKMSKLYHLTNQKARFKQVLGEGKEEEYLTKKQYLTRGHLSPRADHSLLCSQRASFLYLNTAPQWRRGNAGDWAALEEALRRRVHSYGRPVTVYTGTFGVSTLGDTSSRQQQLYLSVDQNNNGILPVPLYYYKVVFDAVNNTAAAFVSINSSYYNQTMIEKLTFCEDICGSRNYSWLRWRSSDGTHSFCCEYHDFVKTVHDLPGLKALRRRVHSYGRPVTVYTGTFGVSTLGDTSSRQQQLYLSVDQNNNGILPVPLYYYKVVFDAANNTAAAFVSINSSYYNQTMIEKLTFCEDICGSRNYSWLRWRSSDGTHSFCCDYHDFVKTVHDLPGLKVEGLFY-