Monarch geneset OGS2.0

DPOGS200608
TranscriptDPOGS200608-TA1755 bp
ProteinDPOGS200608-PA584 aa
Genomic positionDPSCF300076 - 241216-250084
RNAseq coverage501x (Rank: top 25%)
Annotation
HeliconiusHMEL0209790.082.92% 
BombyxBGIBMGA008967-TA3e-16176.84% 
DrosophilaCG10889-PA1e-10145.84% 
EBI UniRef50UniRef50_D6WUN23e-14453.31%Putative uncharacterized protein n=4 Tax=Pancrustacea RepID=D6WUN2_TRICA
NCBI RefSeqXP_973205.14e-14752.21%PREDICTED: similar to CG10889 CG10889-PA [Tribolium castaneum]
NCBI nr blastpgi|910879738e-14652.21%PREDICTED: similar to CG10889 CG10889-PA [Tribolium castaneum]
NCBI nr blastxgi|910879736e-14651.88%PREDICTED: similar to CG10889 CG10889-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[201-355] IPR0218698.1e-61Ribonuclease Zc3h12a-like
Orthology groupMCL11335 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200608-TA
ATGTCAAACGAAACAGACCTGTCTATATCGGTTCCTCGGTCGTGCTTAACCGAATTTTACAGTAAATATCTCCATTTGGTGGAAACTTTTTACTATGTTACGCTTTCTGAAAAGAAATCCGATGTCGCGAGTGAGCAGTCGGAATATGTGTATTTCAAAGTAACTTTTAATTTAAATCCAGTTGAGAGCAATTTAATTCCAAGGGAACTTGTGGACAGAATTCAGAAATATGTGGAGTCATCATTGGGCTGGACGGGCGAACGTGGTGAAGATTCGAGCTATGACTCCGAGTGTGATGATGATGCCAGTCATCGGACCGCCTCTCGCACACCTTCAGACACGCTGGCGGCTGAATTTGCGGAGTACGTCACGCTTGCGCAACCACAACCCAATCAGGCGAAAATCGAGTTTGCCGTGAAACTGGGCTACTCGGAGCGGTTGGCTCGTACCGCGTTGCAGCGGCTCGGTTCAGATCCGCCTCGGAACGAGCTGTTGGCGGAGCTCATCAAGCTGGCTGCTAAGAGGCCTCCGGGGACCTCCTCCCCCCCACCTCCGACAGCCTCGCCTCCACCACCCACGGACGACCTCCCCGACAGACCTCTTAGACACATCGTTATTGATGGAAGCAATGTTGCCATGAGCCATGGCAACAAGGAAGTGTTCTCGTGTCGCGGCATCGAGATCTGCGTAGACTGGTTCAGAGCAAGGGGTCACAAGGATATAACAGTGTTTGTGCCGAAATGGAGGAAAGAAGCTTCCAGACCGGACAATCCGGTCGCCGACAGAGACGCTCTCGATAGATTAGAACGGAACCGAGTGCTGGTGTACACGCCCAGTAGACTCTTGGGAGGGAAACGTCTCATATGTTACGACGACAGATACATTCTGCGATTAGCGGCAGAGACGGACGGCATAGTCGTCTCAAACGACAATTACAGAGATCTAGCGGCGGAGAGCCCTGAATTCAGAAAAGTTGTGGAAGAACGCTTGCTCATGTTCTCCTTCGTCAACGACAGATTCATGCCACCGGACGATCCTCTCGGACGATCAGGACCTACTTTGGACACCTTCCTCCGAGCACCACCGTCCCGTAACGACGTTCGCTCATACGGTCCGTACGGCCCACCACCTGCTTGTCCTTACGGAAGGAAATGCACGTACGGGAATAAATGCAAATTCCATCATCCCGAACGAGCCGGACGGCCTCACAAATCCGTAGCCGAGAAGTTATCCGAACGTGCCGCTAGGAAGGCGAAAGATTTACATACTCTCAGTCTTCCACCGGGAGGTCGACCCGCGGACAGTAAAAGACCTCTGGCCAGAGCCCATTCTGCCACGCCGCGATTGGCTGACGCTTCGCTGATATCCAATTTCGACAGAGCCCAAATACCAGGACCCTCGCAGGCAGAGCATAACCCGCACCGCAAACTCGCCAGACAGCTGACACTCAACCCAACGTGTGACCCGCGACTCCACGCTACAGCGGTTAGAGTATCGTCTGCCCCAGTGGGCGCGGCCGCGGCCCTACATACTTACAACAACATTTCCCTCTCGCCGTGTGCCTCCGAGGGCGCCTTGCAGAACTGGGAGGCCCGACGTAGAATGCATTTCCATTTGGCCGGTGTGTTCCCTGAAGCTCAAGTCGCGGAAGCGATGGCCTCATATCCAGATGAAACTGACGCCAAGAAGATGTGCGCTATCATTTTAGACAGATACCGTCCGAGCGAGTCGCTACCACCACGGCCTACCCACTGA

Protein sequence:

>DPOGS200608-PA
MSNETDLSISVPRSCLTEFYSKYLHLVETFYYVTLSEKKSDVASEQSEYVYFKVTFNLNPVESNLIPRELVDRIQKYVESSLGWTGERGEDSSYDSECDDDASHRTASRTPSDTLAAEFAEYVTLAQPQPNQAKIEFAVKLGYSERLARTALQRLGSDPPRNELLAELIKLAAKRPPGTSSPPPPTASPPPPTDDLPDRPLRHIVIDGSNVAMSHGNKEVFSCRGIEICVDWFRARGHKDITVFVPKWRKEASRPDNPVADRDALDRLERNRVLVYTPSRLLGGKRLICYDDRYILRLAAETDGIVVSNDNYRDLAAESPEFRKVVEERLLMFSFVNDRFMPPDDPLGRSGPTLDTFLRAPPSRNDVRSYGPYGPPPACPYGRKCTYGNKCKFHHPERAGRPHKSVAEKLSERAARKAKDLHTLSLPPGGRPADSKRPLARAHSATPRLADASLISNFDRAQIPGPSQAEHNPHRKLARQLTLNPTCDPRLHATAVRVSSAPVGAAAALHTYNNISLSPCASEGALQNWEARRRMHFHLAGVFPEAQVAEAMASYPDETDAKKMCAIILDRYRPSESLPPRPTH-