Monarch geneset OGS2.0

DPOGS209310
TranscriptDPOGS209310-TA1218 bp
ProteinDPOGS209310-PA405 aa
Genomic positionDPSCF300234 - 156693-160701
RNAseq coverage481x (Rank: top 26%)
Annotation
HeliconiusHMEL0058747e-7664.62% 
BombyxBGIBMGA013811-TA1e-7967.54% 
DrosophilaCG12016-PB2e-1539.08% 
EBI UniRef50UniRef50_D6W6D74e-3542.78%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W6D7_TRICA
NCBI RefSeqXP_970701.17e-3642.78%PREDICTED: similar to Nicotinamide riboside kinase 1 [Tribolium castaneum]
NCBI nr blastpgi|3800122955e-3534.17%PREDICTED: nicotinamide riboside kinase 1-like [Apis florea]
NCBI nr blastxgi|910943712e-3439.42%PREDICTED: similar to Nicotinamide riboside kinase 1 [Tribolium castaneum]
Group
KEGG pathwaytca:6592832e-35 
 K10524 (NRK1)maps-> Nicotinate and nicotinamide metabolism
Orthology groupMCL11711 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209310-TA
ATGTCTTCAGAGTTAGTGCACACGGCGCAATTCGTAGATCAACAAGTTCCCTGTGACACACCAAAGCAAAGATACAAGAAACAGAAGATCACTTCTTCGTCTTCTGAAGAATCAGACAGAGGCACTAAGAAACTGCTCGGTCTAGCAAATTCTACCACCATAAGCTCTATACTTGATAAGATCATCGAGAGAGCGATCAAAGGAAAGCCGGAGAAAAAGATCGAAGTGAAAATAGATGAACACAAATTTGTGGATATATCAGCATATACCGGATCCAAAGTGGGTGTTAGTGTTCTGCGCTCTCTGGGGCTCGTTGTTTGTAGGAGGTCTCATGCTGTTCACCTTCTACCACGTGTACCCTCACTTATAAAGACTGACCTCCACAAACATCCACAACAGCAGTCAACACACCCCCGACCGAGCCAGTGCCTGTGTCATGTGCATAAAGCCTTACGTTGGGTTAAAGCATATTCATTAATAGGACATTTAGAGAATAAGAAGAGTTGGGGATGTCCCCGCCCGCCGTCCAACACTGACCCCTATAATAAACTAAGCGTATATATAATTTTAAGACTTCTGTGCAGTACGAGCGCGAACAGAGACTCGATCATGGCTAGCGAGTGGATCGTGATAGGCATCTCCGGTGTCACGTGCGGGGGGAAGACGACCCTCGCGGACAAGTTGAAGGAGGCTCTGCATCCGGTATACGTGTTCCACCAGGACAGGTACTTCTACAGCGACGACAGTCCCAAGCACGTGCGCTGCGAGGGCTTGGATCACAACAACTACGACATACTGAGCGCGCTCGACATGGACGCCATGTACCGAGATGTGATCAGCACTATGAGAGGGGTGGATCGCGCCCACAGCCAGGGTGGGCGGGCGGTGCCCGGGAAGTTACACGCGCCCGGGAAGAAGTTCCTCGTCATAGAGGGATTCACCGTGCTCAACTACACGCCCCTCATGGACATATGTGACTACAGGTACTACTTGGTTCTGGAGTACGGCGAGTGTTTCTCTCGGCGCGCGCTCCGCCTGTACGAGCCGCCCGACGTCGCCGGCTACTTCGAGACCTGCGTGTGGCCGGAACACCTCAAGTACAGGGCCCAGATAGAGCGTGACCCGCGTGTCACGATCCTAGAGGGCGCTGGCTCCGACCCATTCTACGTAGTGATGGCCGACTTGAAGAGAGGCGGAGCGAGGGAGATAGAGAATTAA

Protein sequence:

>DPOGS209310-PA
MSSELVHTAQFVDQQVPCDTPKQRYKKQKITSSSSEESDRGTKKLLGLANSTTISSILDKIIERAIKGKPEKKIEVKIDEHKFVDISAYTGSKVGVSVLRSLGLVVCRRSHAVHLLPRVPSLIKTDLHKHPQQQSTHPRPSQCLCHVHKALRWVKAYSLIGHLENKKSWGCPRPPSNTDPYNKLSVYIILRLLCSTSANRDSIMASEWIVIGISGVTCGGKTTLADKLKEALHPVYVFHQDRYFYSDDSPKHVRCEGLDHNNYDILSALDMDAMYRDVISTMRGVDRAHSQGGRAVPGKLHAPGKKFLVIEGFTVLNYTPLMDICDYRYYLVLEYGECFSRRALRLYEPPDVAGYFETCVWPEHLKYRAQIERDPRVTILEGAGSDPFYVVMADLKRGGAREIEN-