Monarch geneset OGS2.0

DPOGS210574
TranscriptDPOGS210574-TA1062 bp
ProteinDPOGS210574-PA353 aa
Genomic positionDPSCF300408 + 34824-44110
RNAseq coverage184x (Rank: top 49%)
Annotation
HeliconiusHMEL0057992e-6486.15% 
BombyxBGIBMGA009756-TA1e-6166.29% 
DrosophilaCG8080-PA2e-4652.30% 
EBI UniRef50UniRef50_D6WSU12e-6241.85%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WSU1_TRICA
NCBI RefSeqXP_974309.12e-7441.69%PREDICTED: similar to CG8080 CG8080-PA, partial [Tribolium castaneum]
NCBI nr blastpgi|910860813e-7341.69%PREDICTED: similar to CG8080 CG8080-PA, partial [Tribolium castaneum]
NCBI nr blastxgi|1954555801e-4252.87%GK22965 [Drosophila willistoni]
Group
Gene OntologyGO:00039516e-20NAD+ kinase activity
GO:00081526e-20metabolic process
KEGG pathway 
InterPro domain[83-228] IPR0160646e-20ATP-NAD kinase, PpnK-type
[273-334] IPR0174372e-18ATP-NAD kinase, PpnK-type, all-beta
[110-153] IPR0025046.3e-06Inorganic polyphosphate/ATP-NAD kinase, predicted
Orthology groupMCL13898 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210574-TA
ATGTCAGTGTTAAACAATTTTACAGTGTCTATAGCGAAAGGATTAAGACGTCTTCGTCACAATAAAGAGTTATGCGGCCATGGTTCAATCAGTCGCAAAGAAAACACTCGTCTCAAAATGGAAAAATGTCTCATAGTGTCAAAAGTTACCAGATACGAATATGAGAGACATATGCATGACAATATCGCGGATTCTGATCTGGAGGTCATCCTGCGGAAACGAGGGTCGGACTTCGACTCAATGATAGAGACCCACAGACAGCAGAAGGCTTTCGAGGAGAATGTGGCCAGCAGTTTGAAAAGTATGGGTCTGCAGGTGGAGATGGCCAGCCGTTTAACCTACAACGACGAGTTAATAAAGTGGTGTGACGTAGTGGTGCCGTGCGGTGGTGACGGAACCTTCCTACTGGCAGCATCAAGGGTCAGAGACGCCACCAAACCCGTTATAGGCTTCAACAGCGCTCCACACAAATCTGTCGGTAGACTGTGTTTACCCACCTGGTGTTCCAACGACGTGAAAGGTGCCTTCACAGCCCTAAAGGAGGTGTTTATAGGTGAGAGTGTGACGTCACGCGTGTCCCTCCTCAGACTACAAATTGACAACGGACAGTGGACACACACCAAGAGCTCGGGGCTTTGTGTCACAACTGGCACGGGGAGTACATCCTGGCATTACAGCATCAACTGTCTCCGGACCCATTCAGTGCTGGAGCTGATGCAGATACTAAATGAGGATTTTGATGTGAAGATGGATACAACGTTGGAGAAGGCGAGGGAGGTCGCTGAGAGATACAACCAGAAATTAATGTTCCCGCCAGATTCAGCGTATTTGGCGTATTCTGTTCGTGAGTACATCACGTTTGAGGAGTGGCCAGCGCCGCGCGGCCTCAGGGTGAGGGACAGGGCTGGCTGTGTCAGGGTCAAATCGCACTGTAGTGATGCTGGGCTGGTTATAGATGGCAGTGTATCGTTTCCATTCAACGATGGGACTAAGGCATTGCTAGAAGTCTACCCCGAAGATTCCTTGATGACAGTACAAATGGACGACTCTCTGCCTTATTAA

Protein sequence:

>DPOGS210574-PA
MSVLNNFTVSIAKGLRRLRHNKELCGHGSISRKENTRLKMEKCLIVSKVTRYEYERHMHDNIADSDLEVILRKRGSDFDSMIETHRQQKAFEENVASSLKSMGLQVEMASRLTYNDELIKWCDVVVPCGGDGTFLLAASRVRDATKPVIGFNSAPHKSVGRLCLPTWCSNDVKGAFTALKEVFIGESVTSRVSLLRLQIDNGQWTHTKSSGLCVTTGTGSTSWHYSINCLRTHSVLELMQILNEDFDVKMDTTLEKAREVAERYNQKLMFPPDSAYLAYSVREYITFEEWPAPRGLRVRDRAGCVRVKSHCSDAGLVIDGSVSFPFNDGTKALLEVYPEDSLMTVQMDDSLPY-