Monarch geneset OGS2.0

DPOGS202169
TranscriptDPOGS202169-TA1149 bp
ProteinDPOGS202169-PA382 aa
Genomic positionDPSCF300162 + 97382-99911
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0108801e-8747.44% 
BombyxBGIBMGA003312-TA2e-9957.24% 
DrosophilaCG6744-PA7e-8240.66% 
EBI UniRef50UniRef50_Q16LC61e-8648.43%3-5 exonuclease n=2 Tax=Culicinae RepID=Q16LC6_AEDAE
NCBI RefSeqXP_321572.42e-8743.42%AGAP001549-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1571332604e-8648.43%3-5 exonuclease [Aedes aegypti]
NCBI nr blastxgi|1571332603e-8445.22%3-5 exonuclease [Aedes aegypti]
Group
Gene OntologyGO:00036768.3e-42nucleic acid binding
GO:00084083.7e-223'-5' exonuclease activity
GO:00056223.7e-22intracellular
GO:00061393.7e-22nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
KEGG pathway 
InterPro domain[6-247] IPR0123378.3e-42Ribonuclease H-like
[43-210] IPR0025623.7e-223'-5' exonuclease
Orthology groupMCL15328 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202169-TA
ATGGCGAAGTCAAAAATCGTTATAACATCGGCGATCGCTGTAGGTTTTGTTGGTGTAACATACGTGATTTTGAAATATAAGATTAGATCTCGGAAAGCTGAAGATATCCTTAAAAATCTTGATATTAATATCGTAACAACAAAAGCTCAATGCGACGAAGTTGTAAATGAAATGCGGAGGAGGAGTACTTTACATCAGGCTATAGGTTTCGATTGTGAGTGGGTGACTGAAAATGGAAATAGACAACCCATAGCGCTTCTACAGCTGTCCACATTCGATGGATTCTGTGGGCTTTTAAGATTGAATCTTTTAAAAGAAGTTCCTATGTCACTGAAAGAACTTTTAGAGGATAAAAATATTTACAAAGTAGGCGTAGCGCCCATTGATGATGCAAAGTATTTGATTCAGGACTATTCAATTTATGTGAAGTCTACTTTGGATCTGAGGCATATAGTTGAACTAACCGGCCACACTGCTGGCGGGTTGGCAGCACTGGCTAACACATATTTAGGTATTGTGCTAGATAAGAATTGGAGGATCCGTTGCAGCGACTGGGCAGCAGAAGAGTTGACAGAACGGCAAATACATTACGCAGCGACCGACGCTTATGTAGCAATAAAAATCTTTGTTGACATTATAAACACATATAACAGAGACTTTTGGTTTTTCCTGTGGAGTAGAAACAGTGACAGACACTGGGGAAAAATACATGGTCTATGTTCACGATATGCCGATGTTGGTTACAAAGTGAAAAACATTAAAGGATCTCATAAGGAGAATAAAAAAAGTAAAGAGAAGACTATAACCAAGAAGTATTCTCACAGCCCTCGTTCGAGGCCGTTGTATCACAATTGTTTCATGGAAGCTCCTGATGGGGAAGTGCTATGTACATGTGATACGAAAAAAGCAAAATGGTATTTGGACAATAAACTAGCAACCCTAATAAAAGACGATCCCCTTACAATTCGTCTTGCATTCGAACCGGCCGGTCGGTCTGTGGGGGAAGTCGGTCGCTACTACACGCTCGTGAAGGAGAACAAGTGCGTTGTCTGTGGGGCCATGGACTCTTATATACGGAAAAATGTTGTACCCAGAGAATATAGGAAATATTTCCCAGGTAATTTGGTTTCCAATATATATGTGAAATGA

Protein sequence:

>DPOGS202169-PA
MAKSKIVITSAIAVGFVGVTYVILKYKIRSRKAEDILKNLDINIVTTKAQCDEVVNEMRRRSTLHQAIGFDCEWVTENGNRQPIALLQLSTFDGFCGLLRLNLLKEVPMSLKELLEDKNIYKVGVAPIDDAKYLIQDYSIYVKSTLDLRHIVELTGHTAGGLAALANTYLGIVLDKNWRIRCSDWAAEELTERQIHYAATDAYVAIKIFVDIINTYNRDFWFFLWSRNSDRHWGKIHGLCSRYADVGYKVKNIKGSHKENKKSKEKTITKKYSHSPRSRPLYHNCFMEAPDGEVLCTCDTKKAKWYLDNKLATLIKDDPLTIRLAFEPAGRSVGEVGRYYTLVKENKCVVCGAMDSYIRKNVVPREYRKYFPGNLVSNIYVK-