Monarch geneset OGS2.0

DPOGS213765
TranscriptDPOGS213765-TA1362 bp
ProteinDPOGS213765-PA453 aa
Genomic positionDPSCF300212 - 155299-159418
RNAseq coverage9590x (Rank: top 1%)
Annotation
HeliconiusHMEL0178587e-17585.26% 
BombyxBGIBMGA009249-TA2e-15385.25% 
DrosophilaNap1-PA1e-7143.09% 
EBI UniRef50UniRef50_Q1HQB92e-15983.29%Nucleosome assembly protein isoform 1 n=3 Tax=Obtectomera RepID=Q1HQB9_BOMMO
NCBI RefSeqNP_001040201.12e-16083.29%nucleosome assembly protein isoform 2 [Bombyx mori]
NCBI nr blastpgi|1140520185e-15983.29%nucleosome assembly protein isoform 2 [Bombyx mori]
NCBI nr blastxgi|2905606554e-17683.24%nucleosome assembly protein isoform 1 [Bombyx mori]
Group
Gene OntologyGO:00056345.1e-111nucleus
GO:00063345.1e-111nucleosome assembly
KEGG pathway 
InterPro domain[5-357] IPR0021645.1e-111Nucleosome assembly protein (NAP)
Orthology groupMCL12519 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213765-TA
ATGGGTACTGTTGAGCGAACTGGTGATGCCACCTCCGAGGTGGAGTCAGGAGAGGAAGACGAAATCGTCGGCGGCGGTGAATTGGCAGCGCATTTAATGAGCAGCCGTATCACCCGCAATGAAATGCTAGCCGCCATAACGAATCGTCTGCATGCTGAAGCGATCGCCTCTCTGCCGCCCAATGTCCGTCGGAGAATCAGAGCTCTGAGGTCCTTGCAGAAGGAGTTTGTAGATATCGAGGCAAAATTCTACTCTGAAGTCCATGCTCTCGAGTGCAAGTATGAGAAAATGTATAAGCCGCTATTTGAAAAGCGAGCTCAAATTGTAAATGGTTCATATGAACCTACAGACAACGAGTGCTTGAACCCCTGGCGTGACGACTCTGAGGAGGAGGAATTGGCAAAGGCTGTGGAGAAGGCGGCCATCGCTGAGGGGGATGACAAACACGAAGACAACACACCCGCACAACCTCCGATGGACCCCAACGTCAAAGGGATACCAGACTTTTGGTACACCATCTTCAAAAATGTGTCCATGTTGTGCGAAATGATGCAAGAACACGATGAACCTATAATTAAAAGCTTACAAGATATTAAGGTGCAAATGCACAACGATCCCATTGGGTTCACACTAGAGTTCCACTTTGCTCCCAACGACTACTTCACGGACACGGTGCTAACAAAAGAGTATTACATGAAATGCAAACCCGAGGAGGACAACCCATTGGAATTCGAGGGACCCGAGATTTATTCATGCAAGGGTTGCAAGATTAATTGGAATAAGGGTAAAAATGTTACCGTGAAGACAATCAAAAAGAAGCAGAAACACAAGTCCCGTGGCTCGGTGCGTACAGTCACTAAGTCTGTTCAGGCCGACTCCTTCTTCAACTTCTTCGCTCCACCTGTCATGCCCGAAGATCCTAACTCCACCCTGGCCTCTGATACTCAGGCTCTTTTAACGGCGGATTTCGAGGTCGGTCATTACATTCGCGAGCGAGTGGTGTCCCGGGCCGTGCTCCTGTACACTGGCGAGGGCCTGGACGAAGACGACGACGATGACTATGAGGAGGAGCCATCTGGTCGAGCCCCGCTTGTCCGCATGTCGGTGTCGAGCTTGGTAGTTAATGTAGACGACAATCAGACGCAGCTACCGCTCGCTGCTAGGCCTCGGTCCGCGGTCGTCACCCGCGCCCGCGGCGACCGGCACCACCAGGCTGGGTATTTACGTCTGGTTGCCACCAAAAGTGTCTTTGGTTTCAGGAGGATTCGTATTCAGACGACGACTCCGGCACCGAGGAGGTTTCCGATGCCGAGGATTGATCGCGCCGCCGCCCCACGTAATGTCCGACAGTCTCGATCCTAA

Protein sequence:

>DPOGS213765-PA
MGTVERTGDATSEVESGEEDEIVGGGELAAHLMSSRITRNEMLAAITNRLHAEAIASLPPNVRRRIRALRSLQKEFVDIEAKFYSEVHALECKYEKMYKPLFEKRAQIVNGSYEPTDNECLNPWRDDSEEEELAKAVEKAAIAEGDDKHEDNTPAQPPMDPNVKGIPDFWYTIFKNVSMLCEMMQEHDEPIIKSLQDIKVQMHNDPIGFTLEFHFAPNDYFTDTVLTKEYYMKCKPEEDNPLEFEGPEIYSCKGCKINWNKGKNVTVKTIKKKQKHKSRGSVRTVTKSVQADSFFNFFAPPVMPEDPNSTLASDTQALLTADFEVGHYIRERVVSRAVLLYTGEGLDEDDDDDYEEEPSGRAPLVRMSVSSLVVNVDDNQTQLPLAARPRSAVVTRARGDRHHQAGYLRLVATKSVFGFRRIRIQTTTPAPRRFPMPRIDRAAAPRNVRQSRS-