Monarch geneset OGS2.0

DPOGS210177
TranscriptDPOGS210177-TA2157 bp
ProteinDPOGS210177-PA718 aa
Genomic positionDPSCF300393 + 43959-60077
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0144900.087.67% 
BombyxBGIBMGA014139-TA0.081.64% 
DrosophilaCG11883-PC3e-13160.56% 
EBI UniRef50UniRef50_A8DY954e-12960.56%CG11883, isoform C n=16 Tax=Endopterygota RepID=A8DY95_DROME
NCBI RefSeqXP_974022.16e-15067.41%PREDICTED: similar to AGAP007730-PA [Tribolium castaneum]
NCBI nr blastpgi|910860671e-14867.41%PREDICTED: similar to AGAP007730-PA [Tribolium castaneum]
NCBI nr blastxgi|910860679e-14467.41%PREDICTED: similar to AGAP007730-PA [Tribolium castaneum]
Group
Gene OntologyGO:00167878.7e-192hydrolase activity
GO:00091668.7e-192nucleotide catabolic process
GO:00054881.2e-49binding
GO:00081526.4e-17metabolic process
GO:00164916.4e-17oxidoreductase activity
KEGG pathwaydme:Dmel_CG118832e-129 
 K01081 (E3.1.3.5)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
    Pyrimidine metabolism
InterPro domain[335-691] IPR0061798.7e-1925'-Nucleotidase/apyrase
[2-264] IPR0160401.2e-49NAD(P)-binding domain
[529-692] IPR0083343.2e-415'-Nucleotidase, C-terminal
[5-22] IPR0023472e-17Glucose/ribitol dehydrogenase
[4-144] IPR0021986.4e-17Short-chain dehydrogenase/reductase SDR
Orthology groupMCL17436 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210177-TA
ATGGCACAAAAAGTCGCAGCTGTTACCGGTTCGAATAAAGGACTGGGTTTCTTCATAGTTAAAAGACTGTGTCAGCACTTTGATGGAATTGTTTACTTACTAGCAAGGAATGAAGAAAGGGGATTGGAGGCGGTAAGAAAATTGAATAAAATGGGATTAAAACCTGAGTTTCATATATTGGATGTCAGTGATAAAGAAAGTATAAAGAAATTTGCTTATTTCATAAAAACAAAACATGGTGGTCTGAATGTTCTTGTAAATAATGCAGCCGTTATGGATTATAAAACGGTTTATCCGTCTTACGAAGGCGCCAAATACAATATCGACGTTAATTACAGAAGTTTACTAGACATTGAAAAATATTTATATCCTTTACTAAGAGATGGCGCTAGAGTTGTCAATGTGTCAAGCATGTGTGGCCACCTTTCCAATTTACGCAACAAAAAGTGGCTAGATTCATTAACAAAAGAAGATTTAGAAACTGAAGATATCAATAATTTTGTTGATGATTATTTGAATAGCGTAAAGAATGGAACTTTTAAAAAAGAAGACTTTGCTGATGAGGGTAAGCATGCAGAACATAGAGTATCTAAAATCGCTATGACAGCTTTGACAATGGTGCAACAAAGAAAATATAAGAATATATCTATAAACGCTATCTATCCCGGTTACTTAAAAACCGATATGGCACCCAAAGGTGTAAAAGATCCAGAAGAAGCAGCAGATGTTATTGTTTACCTTATTTTAGAAGCATCTCCTAATCTAAAAGGCACGTTTATGTGGGACAACAAAAAATTGGTTGATTGGTACGACGTTGATGTGAAATCAGTTCGCATAATGGCATCATTTGCGGCTCAGGCTGCACTACGATCTCGATGGAGTTCATTAATAGAGTCTACGAGTGTCGATGTCCCAGGAGAAGTGACTGATGGTGTCCGTTCCGTTGTTGGATGGCTCAAACAGGCACGAATTAATGTTGTTTGGATGTTTCAAACGGTTCCAGATTTCGGTCTAGACGTGCTGTCAAACCTGGTATCGCAGTGTAACTTCCCATGGCTGATGTCAAATGTTATAGACAACGAAACCGGGAGACCATTGGGGGATGGAAAAATCACTCACGCTCTGATGTGCAATGGTCATAAAATCGGAATGATTGGTCTCGTGGAACAGGAGTGGCTGGAAACTTTAGCGACGATTAACCCTGAAGAGGTAACGTTTATAGATTTCCTACAAGCGGGATCTAAATTGGCGTCACAACTTAAACAAGAGGGTTGTGAATACGTGATAGCACTAACACATATGCGAACTCCGAACGACATCAAACTGGCTGAGGGCTGTACTGACATCGACCTCATTTTGGGGGGACATGACCATGTGTATGAAGTTTTGGAGATAAACAACAAATACATAGTCAAAAGCGGCACCGATTTCCGACAGTTTAGCAAAATTAATATAAACTTCGGCACAGAGAGCGTCAAAGTGGACATCTCAGATGTTAAAGTCACCAGCAATATAGCCGAGGATCCCGTACTAAAAGGGAAAGTTGAGAAATACAGTGCTATGATAGACGGCAAGATGGATGAAGTTCTTGGCAAGTTTTGCGTTCCTCTAGAAGGAAGGTTTTCTGTCGTGAGACGTCAAGAGTGTAATTTGGGGAATTGGGTATGTGATGTTCTTCTAGCAGCCACTGGGGCCGACCTTTTATTACTCAACAGTGGTACCTTCAGATCGGATCAGGTCCATCCAGCAGGAGATTTCACTCTCAGAGATCTATCTACCATAATACCGATGCGAGATCCGCTGGTAGTAGTGGAAGCGTCTGGGGAGACCGTGATACAGGTTCTAGAAAATGCTGTCTCCAAATACCCCAGTCTCGAAGGAAGGTTTCCGCAGGTGGCCGGAATATCCTTTGCTTTTGACCCATCAAAACCTCCAGGTCAAAGGATCGCACAAGAGGTGATAAAAATTGGAGATGAATATCTACAGAAGGACCAGAAATACAGATTAGCAATAAAGCAGTATCTACACGAAGGGAACGACGGATTTAGCATGCTGAAGGATTGTCCCATTCTGAAATTACAAGAACTCCTTGCTGAGAAGGCCCGTTGGGAATCCGACTCGGTGATCAAAGAAGTTGACGATGAAAGTTCACCTTAG

Protein sequence:

>DPOGS210177-PA
MAQKVAAVTGSNKGLGFFIVKRLCQHFDGIVYLLARNEERGLEAVRKLNKMGLKPEFHILDVSDKESIKKFAYFIKTKHGGLNVLVNNAAVMDYKTVYPSYEGAKYNIDVNYRSLLDIEKYLYPLLRDGARVVNVSSMCGHLSNLRNKKWLDSLTKEDLETEDINNFVDDYLNSVKNGTFKKEDFADEGKHAEHRVSKIAMTALTMVQQRKYKNISINAIYPGYLKTDMAPKGVKDPEEAADVIVYLILEASPNLKGTFMWDNKKLVDWYDVDVKSVRIMASFAAQAALRSRWSSLIESTSVDVPGEVTDGVRSVVGWLKQARINVVWMFQTVPDFGLDVLSNLVSQCNFPWLMSNVIDNETGRPLGDGKITHALMCNGHKIGMIGLVEQEWLETLATINPEEVTFIDFLQAGSKLASQLKQEGCEYVIALTHMRTPNDIKLAEGCTDIDLILGGHDHVYEVLEINNKYIVKSGTDFRQFSKININFGTESVKVDISDVKVTSNIAEDPVLKGKVEKYSAMIDGKMDEVLGKFCVPLEGRFSVVRRQECNLGNWVCDVLLAATGADLLLLNSGTFRSDQVHPAGDFTLRDLSTIIPMRDPLVVVEASGETVIQVLENAVSKYPSLEGRFPQVAGISFAFDPSKPPGQRIAQEVIKIGDEYLQKDQKYRLAIKQYLHEGNDGFSMLKDCPILKLQELLAEKARWESDSVIKEVDDESSP-