Monarch geneset OGS2.0

DPOGS215847
TranscriptDPOGS215847-TA2520 bp
ProteinDPOGS215847-PA839 aa
Genomic positionDPSCF300073 + 731323-733842
RNAseq coverage151x (Rank: top 53%)
Annotation
HeliconiusHMEL0074600.060.19% 
BombyxBGIBMGA002947-TA0.048.52% 
DrosophilaCG9247-PA1e-4928.10% 
EBI UniRef50UniRef50_UPI00020632981e-8833.33%UPI0002063298 related cluster n=3 Tax=unknown RepID=UPI0002063298
NCBI RefSeqXP_001861848.15e-8727.01%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3287761615e-8833.33%PREDICTED: probable exonuclease mut-7 homolog [Apis mellifera]
NCBI nr blastxgi|2700033694e-9528.98%hypothetical protein TcasGA2_TC002596 [Tribolium castaneum]
Group
Gene OntologyGO:00036762.6e-45nucleic acid binding
GO:00084084.4e-313'-5' exonuclease activity
GO:00056224.4e-31intracellular
GO:00061394.4e-31nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
KEGG pathway 
InterPro domain[287-575] IPR0123372.6e-45Ribonuclease H-like
[382-569] IPR0025624.4e-313'-5' exonuclease
[619-718] IPR0027821.7e-12Protein of unknown function DUF82
Orthology groupMCL15536 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215847-TA
ATGGACTTAAATAAGTTAGTCCATCAAAATCAAACAATAAAAATCATACCATCGTTAGAGGACTCTCTACGAGGACTTGGCCTGAATAGTGATCTAGATGAAGAAACTGAGATTTGGTTTAATCAATTGAAAATCAAATGGAAAACATGGAAGAAAAGCCCTACGATTGAGCGTCACTTTGACTCATTTTTTCAGTTCTGTCAAGATCCCTTCAGAGTTGCCCTAGTGTGCATTGTCAAATGTGATGAACCAAAAGATCGTAAGCCTAAATCTCTCTCTTATTGCATACTTGAAATTATATTTAAATGGTCCCAGACAAATGGCAGATTACCTGAAGAGACTCTGAAACTTCCAGCGTACAATATAGCAACACAACAGAGAAATCAACATTTTCTTTATTTAGCAGTTAAAACTTATCAATTGTATACTATAAAAGAGACTGTACTTCCTCTTGTAAAAGATATGATAAGAAATGATAATTGCAAACAAGCATCACAAATTGTAATTGCAATGGAACTCTTTGATGAAATTCCTGTTGAAGATTTACTGTTTCCATTGGTTTTGCAAGATACGCCAAACCTAATTGATGAATATTTGTCAGAATCTCCAAATCAAATTCAACCATTTTTATTATTTTTAGATAGACTGTTAGACAAAAACTTTAGTATAAGAGACTATGCTCAAAAATTTATTGAAGAAAATAAAATTTATAATGTTAAATATGATAAAATTCATTATAAACCTCTGGGAAAATTAGTCGCTCGGCTTTGTAATAAATTTAATGTTCCAATAGAGTCATGTAAAAACTTGAGTAAGAATCGTACCACAGGAGGGCTGAGGTATTTAATTCATCAGAGATATGTTGAACACAACTTGAGCCCCTCAGTTTGGGATGATTTGGTAAAAGATTCTTTGAAGCAAAGTACTGACTGTGCTAAAGAATTTGTTGATATGTTAGCAATGTATGACATAAATGAATCACTTAAGTGGTCTTCATATTTCGAAATATCAAATGATTGTCTTCCTCATGCTCTTCAGAATTTAACAATAAAAGATAATCCTATAGAAGAAGAAAATTGGGACTCAACTGACAATGCAGCTCAGAACTATTATAGACTTCCAATATCAGAAGAAAATATTTTAATTATTGACACAGCAGAAAAGTTTGATGAATTAATCTCAAAGTTGTCAAATTGTCCTATCATCAGCTTTGATTGCGAATGGAAACCATCATTTGGTGCTGCTAAATCTCGAATGGCTCTCATTCAAATTGGTACATTTGATCAAGTTTATCTTATTGACACTCTTATATTAAACAACAAGCAATACATGGGTAGTTGGTGCCGGTTTAATAAATATGTATTAGATAATGCGGAGATAATAAAATTGGGTTTTGGAGTTGAACAGGATCTGAATGAAATGAAGTCTTTAATTATTGGTTTGAATAATATCAAGGTTAAAGGTGAAGGACTTTTAGATTTAGGTTTACTGTGGAAAAATCTTGTCAAATGTGGCTTGTCATTACCAAGTAACAGTGATAATGGAGGTAACAGTCTCAGCTCTTTGGTCCAAACTTGCTTTGGATTGCCCTTGGAAAAATCTGAGCAATGTTCAAATTGGGAGTTAAGGCCCTTAAGAAATACTCAGATTCACTATGCTGCTTTGGATGCTTTTGTTTTGTTAGAGATATACAAATACCTTCAAAATCTTTGTGTAGAACAACATATTAATTTTGAGGAAATTTGTAATGATGTAATGTTGGATAGAAAACTGAAATGTCTAAAAAAGAATAAAGTAGTTGATTGTCTGCAGACAACAAAAAATATAAAGGTGAGAACTCCTATGGACGTTAAAATTCTTCTTGAACATGACAATGCACATTTACGATATTATCTAAGATACTGTGGTATTGACACTACTATTACAACTTCCCATATGTTATGGCACGATACTATTAAATTAGCCACATCTGAAAATCGTTTAATATTGACATCTAAATTGAAGTTTTCACCATCTAGCAGATTTTCACAAAACTTTATCTTAGATATAGGTAAAGGAAGCATCAAGGATCAATTATTAAAAATTCTTAAACATTTTAATGTGGGCCTTCAAAAGAATTATATTTTGACAAGATGTATAGAATGCAATTCTACAGATGTAAAATATTACTCTATTAATGATCTCAAAGATATATGTAGAAAATATAATGGTGGTAGCCACAAGTCTTCCGATCAGATCAGAAGGAGTGCTAGTGACAATGAAGATGATAATGATTATTCTGAAAACTTTCTCAGTGATTCAGAAGGGGAAGACATACATTTATACAAACCATTTCCAATACAGGACAAATGGTATACATCTAGCAGTGGAGCTAAAATTAATATGAATCAGATTGAAAAGTTATGTGCTTCCAATAAAACTTCACATATTTGTGAAAATTGTGGAAAACTATATTGCGATGAAGAGCCGTTGCTTAAATCAATACACGAAGTAATCATGTCTATAACAAATTTTAATTAG

Protein sequence:

>DPOGS215847-PA
MDLNKLVHQNQTIKIIPSLEDSLRGLGLNSDLDEETEIWFNQLKIKWKTWKKSPTIERHFDSFFQFCQDPFRVALVCIVKCDEPKDRKPKSLSYCILEIIFKWSQTNGRLPEETLKLPAYNIATQQRNQHFLYLAVKTYQLYTIKETVLPLVKDMIRNDNCKQASQIVIAMELFDEIPVEDLLFPLVLQDTPNLIDEYLSESPNQIQPFLLFLDRLLDKNFSIRDYAQKFIEENKIYNVKYDKIHYKPLGKLVARLCNKFNVPIESCKNLSKNRTTGGLRYLIHQRYVEHNLSPSVWDDLVKDSLKQSTDCAKEFVDMLAMYDINESLKWSSYFEISNDCLPHALQNLTIKDNPIEEENWDSTDNAAQNYYRLPISEENILIIDTAEKFDELISKLSNCPIISFDCEWKPSFGAAKSRMALIQIGTFDQVYLIDTLILNNKQYMGSWCRFNKYVLDNAEIIKLGFGVEQDLNEMKSLIIGLNNIKVKGEGLLDLGLLWKNLVKCGLSLPSNSDNGGNSLSSLVQTCFGLPLEKSEQCSNWELRPLRNTQIHYAALDAFVLLEIYKYLQNLCVEQHINFEEICNDVMLDRKLKCLKKNKVVDCLQTTKNIKVRTPMDVKILLEHDNAHLRYYLRYCGIDTTITTSHMLWHDTIKLATSENRLILTSKLKFSPSSRFSQNFILDIGKGSIKDQLLKILKHFNVGLQKNYILTRCIECNSTDVKYYSINDLKDICRKYNGGSHKSSDQIRRSASDNEDDNDYSENFLSDSEGEDIHLYKPFPIQDKWYTSSSGAKINMNQIEKLCASNKTSHICENCGKLYCDEEPLLKSIHEVIMSITNFN-