Monarch geneset OGS2.0

DPOGS209403
TranscriptDPOGS209403-TA1500 bp
ProteinDPOGS209403-PA499 aa
Genomic positionDPSCF300346 - 152681-171043
RNAseq coverage136x (Rank: top 55%)
Annotation
HeliconiusHMEL0150765e-0833.33% 
BombyxBGIBMGA013961-TA2e-1626.89% 
Drosophila% 
EBI UniRef50UniRef50_Q9BLH91e-0929.33%TRASSc4 protein (Fragment) n=2 Tax=Samia cynthia RepID=Q9BLH9_SAMCY
NCBI RefSeq%
NCBI nr blastpgi|125972222e-1028.80%TRASDJ [Saturnia japonica]
NCBI nr blastxgi|15491433e-1030.68%ORF1 [Bombyx mori]
Group
KEGG pathway 
InterPro domain[312-420] IPR0051351.6e-13Endonuclease/exonuclease/phosphatase
Orthology groupMCL23343 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209403-TA
ATGCAGAAGGTGAAATTTTTGTGCAGGAACTTTTCCGCTTCTAATCAAGTCACTCGTAGCAACTTCACAAATATATTACTGCAACAAGTCAAGACTTATGCCGAGACTGGCACCCGGAGAGAGGGTCAACAGACGACTATAGATTGCAAGAACGCTAATGCAATGCCATCGGGTTTAGTATTACCAGCTGTTTTGGGTGAGGCCCGTAGCATCGACGAATGGGAGATAGGCAGGGCCCTTCCGGGTCTGAGTAGGACCCTGATCAGCCCCAAGACAGCAGCCGCCTCGACGTCGATACCGAAGAAGCCGGAGTCACCTCCAAAGGAGCAAACTCCTGGGGAAAAGAGCGTGAACAGGGCACAAGAAGCTAGGTCCGCAAAGATGTCGGCGACCAAGCTTATCGGCCTATCCAGGAACCTAAAGACTGAACTTAAAGAGGGTATTCTCCAAGCCATACAGAGGCTCTATGAGCTTGTGAGGGAGGGAGAAGAGGATAACCTCATTAAGCAGGATGAGCTGGATGAGCTGAGAGCAAAGTTAACTGCCCCAACTGACGCTACCTCGGCTCCCCAAGCAAACCCAGCCCAACAAACAGCGTTGATAAAGAGGCTCGAAGATCACAGTCTTCTGATGAAGGAGCACATTGAGGCCATAAAGGGCCTAAAAAAGCTAGGAGAAGACAAGCCCACTACCAAGGAAATACGAATAGAAGCAGGACCGTCAAATCAAAAACTGGAGGAGCACACGCTTTTACTGAAGGAGAACACAGAAACCTTAAAAACTCTTCAAGAACAGTTGAAAGGCTGTCGGGAGGAGCCCATCCGAGTTTGGGCTCTTTTTAAAGAGGAGGAGAGGGCGAGTAACGAGAAGCCATCGACGGCCGGATCCTCGATTTTTAGGACTCCAAAAATTTTTGGAAACTACCTCGATGAGGCGGATACATTACACTCATTAGAACATCTTCTCTCTCACAACACCAACACTCGCTGCATTATCGGCAGCGACTTTAATGGATGGCAACCACACTGGGGGAGCGACCGCACGAACGTAAGGGGGAACGACATCATGGAGTTCTCCCTTATCCACGCAATGCGGACAACATTCGAAACCATCACCCACAATCGCGTATGCTCCTCCATCATCGACATCACACTTGTTTCATCCACAATCTTTCACCAGATCACAGACTGGAAGGTGAACTTCTACGCATGTCCGTCTTCACAACACAATGCAATAGACTTCACATTCACACACCCGCACTCTCAAATCAAGTCACACACTCCTCAACATACCAACACATCCACCTTCCGATACAAGAACCACAAAGCGAGTTGGAGATTCTTCAAGGAAATAATTCTCTCACAACACAGCCTCAGTAATACGCTCGACACAGACATACCCGCACTAGACATGGATCAGCTTGATTCTTTCATCGATACCATCACCAAATCCATCCACACCGTCTGTAGGGCATTAATGCCCATCAGATCGTCCGGCTAG

Protein sequence:

>DPOGS209403-PA
MQKVKFLCRNFSASNQVTRSNFTNILLQQVKTYAETGTRREGQQTTIDCKNANAMPSGLVLPAVLGEARSIDEWEIGRALPGLSRTLISPKTAAASTSIPKKPESPPKEQTPGEKSVNRAQEARSAKMSATKLIGLSRNLKTELKEGILQAIQRLYELVREGEEDNLIKQDELDELRAKLTAPTDATSAPQANPAQQTALIKRLEDHSLLMKEHIEAIKGLKKLGEDKPTTKEIRIEAGPSNQKLEEHTLLLKENTETLKTLQEQLKGCREEPIRVWALFKEEERASNEKPSTAGSSIFRTPKIFGNYLDEADTLHSLEHLLSHNTNTRCIIGSDFNGWQPHWGSDRTNVRGNDIMEFSLIHAMRTTFETITHNRVCSSIIDITLVSSTIFHQITDWKVNFYACPSSQHNAIDFTFTHPHSQIKSHTPQHTNTSTFRYKNHKASWRFFKEIILSQHSLSNTLDTDIPALDMDQLDSFIDTITKSIHTVCRALMPIRSSG-