Monarch geneset OGS2.0

DPOGS206271
TranscriptDPOGS206271-TA2844 bp
ProteinDPOGS206271-PA947 aa
Genomic positionDPSCF300290 - 193570-212266
RNAseq coverage38x (Rank: top 73%)
Annotation
HeliconiusHMEL0168983e-5761.46% 
BombyxBGIBMGA010748-TA1e-5970.67% 
DrosophilaNmnat-PA8e-4439.92% 
EBI UniRef50UniRef50_UPI00021A85D37e-4646.74%UPI00021A85D3 related cluster n=2 Tax=unknown RepID=UPI00021A85D3
NCBI RefSeqXP_973580.13e-5052.49%PREDICTED: similar to nicotinamide mononucleotide adenylyltransferase 1 [Tribolium castaneum]
NCBI nr blastpgi|910899596e-4952.49%PREDICTED: similar to nicotinamide mononucleotide adenylyltransferase 1 [Tribolium castaneum]
NCBI nr blastxgi|910899593e-4749.27%PREDICTED: similar to nicotinamide mononucleotide adenylyltransferase 1 [Tribolium castaneum]
Group
Gene OntologyGO:00167792.7e-89nucleotidyltransferase activity
GO:00094352.7e-89NAD biosynthetic process
GO:00090584.2e-19biosynthetic process
KEGG pathwaytca:6623898e-50 
 K06210 (NMNAT)maps-> Nicotinate and nicotinamide metabolism
InterPro domain[4-733] IPR0052482.7e-89Probable nicotinate-nucleotide adenylyltransferase
[3-173] IPR0147291.9e-32Rossmann-like alpha/beta/alpha sandwich fold
[9-170] IPR0048204.2e-19Cytidylyltransferase
Orthology groupMCL10620 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206271-TA
ATGTCGCAAGGAAAAATTGTATTAATGGCTTGCGGCAGCTTCAGTCCGCCGACTTACATGCATTTACGAATGTTCGAAATAGCAAAGGATTATATTCACTCATTGGGTCTGGGTACGATAATTGGTGGAATCGTCTCACCGGTTCACGACGCATACGGTAAAAAGGATCTAGTCGCTGCACATCACAGAATCTCAATGCTGAAGCTGGCATTGCGTTCATCGGGATGGATTAAGGTTTCCGAGTGGGAGACCCAACAGGCTGGTTGGACGAGGACCAAGGTTTCCTTACAATATCATCAGGATGCCATAAACAACAACTTAACCGGCAACAATGACAACCCGCCATCTTGGTTGCCGGATGACATGTTGAATGTGAACAACATCGAGCCGCGGGATTTCAATAATAAATTGAACGAACGATTGAATGGCAACGCGGAGGATAGGGTGACGGTGAAGCTGTTGTGCGGAGCGGATCTGTTGGAATCATTCGCCACTCCCGGACTTTGGTCGGATGAAGATGATGCCATAAACAACAACTTAACCGGCAACAATGACAACCCGCCATCTTGGTTGCCGGATGACATGTTGAATGTGAACAACATCGAGCCGCGGGATTTCAATAATAAATTGAACGAACGATTGAATGGCAACGCGGAGGATAGGGTGACGGTGAAGCTGTTGTGCGGAGCGGATCTGTTGGAATCATTCGCCACTCCCGGACTTTGGTCGGATGAAGATATGGAAGCCATAGTTGGTCGCCACGGCTTGGTTGTGGTGAGTCGCGCGGGCTGCGATCCGGGGAGATTCATCTACGAATCGGACATGCTGTATAAATATAGGGATGCCATAAACAACAACTTAACCGGCAACAATGACAACCCGCCATCTTGGTTGCCGGATGACATGTTGAATGTGAACAACATCGAGCCGCGGGATTTCAATAATAAATTGAACGAACGATTGAATGGCAACGCGGAGGATAGGGTGACGGTGAAGCTGTTGTGCGGAGCGGATCTGTTGGAATCATTCGCCACTCCCGGACTTTGGTCGGATGAAGATATGGAAGCCATAGTTGGTCGCCACGGCTTGGTTGTGAGGAATGTTACTCTAGTGACGAATTACATAGCCAACGAGGTGTCCTCGACCGTCCTGAGGAGGTTGATGCGGCGCGGAGAGAGCGCCAAGTATCTGACTGAAGATAGCGTGCTGGCTTACATCAGGCAGAACTGTCTGTATGGAGCCGAGCCGTTTGTCACTGAGTATAACATACTTAATGACCTAATAGACAATTACGATAAGTCACCCCAAGACATAGTAATGGCGTCGCCGGAGGAGGCCAGCTTCAAGAACATACTGATATCGATCAGAGATAAACCGTCTATAGTCGACGAGACGATAACCGTGAAACGAAAGATAACCAACTTCCTTACACCGCACACCGACACGGTCAGCCCGGCGCAAGGCCCGAGACCAAAGATGGCCTACATAGAGAAGGCACCCAGCACATACATACCCGGGAAGGCCGTCAAGATCATAAGCGACAAGAAACAGCACAGACTAGAGGACGAGGTAAGTTGTGATAAGTACAGCTCGCTCGACAGCTACCTGGCCAAGGAGGAAGGCGACATCTACCAGCGGAGAGTCAGCGAGAGCAACATAACCAAAGAGAAGAAGAGGTGCTCGGCGTCGACTATCAGGAAACTGAAGTCCGATGACATGAAGAAGAGCAAGTCGGAGGATGCCATAAACAACAACTTAACCGGCAACAATGACAACCCGCCATCTTGGTTGCCGGATGACATGTTGAATGTGAACAACATCGAGCCGCGGGATTTCAATAATAAATTGAACGAACGATTGAACGGCAACGCGGAGGATAGGGTGACGGTGAAGCTGTTGTGCGGAGCGGATCTGTTGGAATCATTCGCCACTCCCGGACTTTGGTCGGATGAAGATATGGAAGCCATAGTTGGTCGCCACGGCTTGGTTGTGGTGAGTCGCGCGGGCTGCGATCCGGGGAGATTCATCTACGAATCGGACATGCTGTATAAATATAGGAGGAATGTTACTCTAGTGACGAATTACATAGCCAACGAGGTGTCCTCGACCGTCCTGAGGAGGTTGATGCGGCGCGGAGAGAGCGCCAAGTATCTGACTGAAGATAGCGTGCTGGCTTACATCAGGCAGAACTGTCTGTATGGAGCCGAGCCGTTTGTCACTGAGTATAACATACTTAATGACCTAATAGACAATTACGATAAGTCACCCCAAGACATAGTAATGGCGTCGCCGGAGGAGGCCAGCTTCAAGAACATACTGATATCGATCAGAGATAAACCGTCTATAGTCGACGAGACGATAACCGTGAAACGAAAGATAACCAACTTCCTAACACCGCACACCGACACGGTCAGCCCGGCACAAGGCCCGAGACCAAAGATGGCCTACATAGAGAAGGCACCCAGCACATACATACCCGGGAAGGCCGTCAAGATCATAAGCGACAAGAAACAGCACAGACTAGAGGACGAGGTAAGTTGTGATAAGTACAGCTCGCTCGACAGCTACCTGGCCAAGGAGGAAAGCGACATCTACCAGCGGAGAGTCAGCGAGAGCAACATAACCAAAGAGAAGAAGAGGTGCTCGGCGTCTACTATCAGGAAACTGAAGTCTGATGACATGAAGAAGAGCAAGTCGGAGGTAAGTAAGCTGTGTGATAAGATGAAAAGCATTAAAATAAAGGAAACAAAGAACTATAAGACGAGGAGTTGCAATGACATCGTCAAGTTAATACTCACCAAACATGGCATTCATGTCATAAGCGACACAGAGGCCATTGTGTGA

Protein sequence:

>DPOGS206271-PA
MSQGKIVLMACGSFSPPTYMHLRMFEIAKDYIHSLGLGTIIGGIVSPVHDAYGKKDLVAAHHRISMLKLALRSSGWIKVSEWETQQAGWTRTKVSLQYHQDAINNNLTGNNDNPPSWLPDDMLNVNNIEPRDFNNKLNERLNGNAEDRVTVKLLCGADLLESFATPGLWSDEDDAINNNLTGNNDNPPSWLPDDMLNVNNIEPRDFNNKLNERLNGNAEDRVTVKLLCGADLLESFATPGLWSDEDMEAIVGRHGLVVVSRAGCDPGRFIYESDMLYKYRDAINNNLTGNNDNPPSWLPDDMLNVNNIEPRDFNNKLNERLNGNAEDRVTVKLLCGADLLESFATPGLWSDEDMEAIVGRHGLVVRNVTLVTNYIANEVSSTVLRRLMRRGESAKYLTEDSVLAYIRQNCLYGAEPFVTEYNILNDLIDNYDKSPQDIVMASPEEASFKNILISIRDKPSIVDETITVKRKITNFLTPHTDTVSPAQGPRPKMAYIEKAPSTYIPGKAVKIISDKKQHRLEDEVSCDKYSSLDSYLAKEEGDIYQRRVSESNITKEKKRCSASTIRKLKSDDMKKSKSEDAINNNLTGNNDNPPSWLPDDMLNVNNIEPRDFNNKLNERLNGNAEDRVTVKLLCGADLLESFATPGLWSDEDMEAIVGRHGLVVVSRAGCDPGRFIYESDMLYKYRRNVTLVTNYIANEVSSTVLRRLMRRGESAKYLTEDSVLAYIRQNCLYGAEPFVTEYNILNDLIDNYDKSPQDIVMASPEEASFKNILISIRDKPSIVDETITVKRKITNFLTPHTDTVSPAQGPRPKMAYIEKAPSTYIPGKAVKIISDKKQHRLEDEVSCDKYSSLDSYLAKEESDIYQRRVSESNITKEKKRCSASTIRKLKSDDMKKSKSEVSKLCDKMKSIKIKETKNYKTRSCNDIVKLILTKHGIHVISDTEAIV-