Monarch geneset OGS2.0

DPOGS211635
TranscriptDPOGS211635-TA1680 bp
ProteinDPOGS211635-PA559 aa
Genomic positionDPSCF300325 - 73158-74956
RNAseq coverage242x (Rank: top 43%)
Annotation
HeliconiusHMEL0066380.070.37% 
BombyxBGIBMGA011782-TA0.067.82% 
DrosophilaCG8368-PA3e-11539.89% 
EBI UniRef50UniRef50_UPI00021A708C4e-12442.02%UPI00021A708C related cluster n=2 Tax=unknown RepID=UPI00021A708C
NCBI RefSeqXP_001607810.12e-12243.07%PREDICTED: similar to GA21025-PA [Nasonia vitripennis]
NCBI nr blastpgi|3838604901e-12342.11%PREDICTED: putative RNA exonuclease NEF-sp-like [Megachile rotundata]
NCBI nr blastxgi|3454918882e-12142.37%PREDICTED: putative RNA exonuclease NEF-sp-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00045272e-35exonuclease activity
GO:00056222e-35intracellular
GO:00036766.8e-26nucleic acid binding
KEGG pathway 
InterPro domain[244-403] IPR0060552e-35Exonuclease
[227-400] IPR0123376.8e-26Ribonuclease H-like
[247-393] IPR0135208e-24Exonuclease, RNase T/DNA polymerase III
Orthology groupMCL14061 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211635-TA
ATGAGTGCTGAAAATGGGATATTCGAGGAGCCCGTCAGCAAAAAACGCCGTCTTGATTCTACTAAAGAGAAGCCTAAAACGAAGAAAAAGCCTATTCCTGTATTTCGATTAAAACCTACTGGAGAGACGGCATCATTGTTTTTGTCATCAGAGGAAAGAGTGCCTCTAGTTCTCACAGACATTCAACATTTACTATTACATTCGTTATTCGGAAACCTAAATCTTACCCAGCCACCACGTTGGTATACCATAGACAAATGTAACCACATCACCCAAACAACCTGTCTTATTATTGAAGGTATATCTATCCAGGACTGGGAAAACTATCAAGATATGCTGAAGAGTACACACAAAATTTTTGATAGTGCCATGGAAGTTTTAACTCCCTCTGTGTATAATGGATCCTTAGTTAAAGAAATGGCCTTAGTACCACTGTCTGAAAATGAAAAAGAAACTTTAATCCAAAAATATGGAAGCATGAATTTAGCTCTTGAGGTGAGAAAGGATTTGATGGTGATGATGAAAGCTGTTTTTCCTATCAGAGATGAAATAACAGGTTCGGTAGATGTAAGACGACTTGATGAAAAATTTCCAAGGACACAGCTTATGCTCTCAGCATGGCAGCTGATAGATGAAAACTATCCAGTTCCACTTAAAGGTAAATTACAGAATGTGTACGCTGATTATGTTATGACAAAAGAAGAGTATTCACCAGTTACAGCGGAATCACCCATGTTTGGTTTAGATTGTGAAATGTGTCTTACAAAAGCTGGTTCAGAGTTAACCCGAGTATCAATCGTCAATGAAAAACACGAAACTGTATATGAATCTTTTGTCAAACCCTACAACCAGATAATGGATTATCTAACTCAGTATTCTGGTATTACTGAGGAGTTATTGAGAGATGTAACAAAAAGGCTTGAAGATGTGCAGAAAGAAATACAAGAGCTTCTGCCCTCTGATGCAATATTGGTCGGACAGTCTCTGAATTCAGATTTACATGCCTTAAGATTAATGCACCCATATATCATTGACACAAGTTTGATCTACAATTTCACCGGAGAAAGATATCGCAAGCCGAAGTTAAAAACATTAGCAAAGGAATATCTCAAAGAGGAAATTCAGACCGGCACAGATGGTCACTGCTCAGTGGAGGATTCCCTTGCCTCCCTCAAACTTGTTCAGCTGAAATTAAGCAAGAGTGTCGAATTTGGTGATGCAGTACACACAAAGAGGCAAAAATACAAAGAAGAAGTCAATAAAATGGTCACCGAACCCCATTATGCTTTGTCCATTTTTAATCACATAATAGAACAGAAGAAAACCTCAGTAATCATCGGCTGTGACAATATAACAGGCGACTATCATACATTTTTGACCCAAGCAAAGGAAAGTTTAAGTACTCAGTTGAAGAAAGGTAAACCTAAGAGAGTTAAATTGAACACAGTGGATAGTATGGATGAAGTCATAACAACATTAACAGAATCAGTGAAAAATTATAACTTAGTCATGGGACACTTGAAATTGGAGGCGTCCGAAGATGATACAAAGTTAATGCAAACAGTTGATGGCTGGGTAGAAACTGTGTGGAACAGTATTCAAGAATCAGCTATTTGTGTTATTGTGTTCGGTGGAACTGTCAACGGTAATGGAGTAGCGATGATGAAGGTGAAAACTTAA

Protein sequence:

>DPOGS211635-PA
MSAENGIFEEPVSKKRRLDSTKEKPKTKKKPIPVFRLKPTGETASLFLSSEERVPLVLTDIQHLLLHSLFGNLNLTQPPRWYTIDKCNHITQTTCLIIEGISIQDWENYQDMLKSTHKIFDSAMEVLTPSVYNGSLVKEMALVPLSENEKETLIQKYGSMNLALEVRKDLMVMMKAVFPIRDEITGSVDVRRLDEKFPRTQLMLSAWQLIDENYPVPLKGKLQNVYADYVMTKEEYSPVTAESPMFGLDCEMCLTKAGSELTRVSIVNEKHETVYESFVKPYNQIMDYLTQYSGITEELLRDVTKRLEDVQKEIQELLPSDAILVGQSLNSDLHALRLMHPYIIDTSLIYNFTGERYRKPKLKTLAKEYLKEEIQTGTDGHCSVEDSLASLKLVQLKLSKSVEFGDAVHTKRQKYKEEVNKMVTEPHYALSIFNHIIEQKKTSVIIGCDNITGDYHTFLTQAKESLSTQLKKGKPKRVKLNTVDSMDEVITTLTESVKNYNLVMGHLKLEASEDDTKLMQTVDGWVETVWNSIQESAICVIVFGGTVNGNGVAMMKVKT-