Monarch geneset OGS2.0

DPOGS210875
TranscriptDPOGS210875-TA1251 bp
ProteinDPOGS210875-PA416 aa
Genomic positionDPSCF300027 + 1364557-1368117
RNAseq coverage1574x (Rank: top 8%)
Annotation
HeliconiusHMEL0214656e-5383.78% 
BombyxBGIBMGA007000-TA9e-12764.17% 
DrosophilaCG8223-PA6e-1725.40% 
EBI UniRef50UniRef50_E2C4L85e-6136.14%Nuclear autoantigenic sperm protein n=2 Tax=Formicidae RepID=E2C4L8_HARSA
NCBI RefSeqXP_392012.35e-6437.97%PREDICTED: similar to nuclear autoantigenic sperm protein (histone-binding) [Apis mellifera]
NCBI nr blastpgi|3287828629e-6337.97%PREDICTED: hypothetical protein LOC408464 [Apis mellifera]
NCBI nr blastxgi|3071818573e-6939.24%Nuclear autoantigenic sperm protein [Camponotus floridanus]
Group
Gene OntologyGO:00054884.7e-08binding
KEGG pathway 
InterPro domain[202-238] IPR0195442.6e-09Tetratricopeptide, SHNi-TPR domain
[208-277] IPR0119904.7e-08Tetratricopeptide-like helical
Orthology groupMCL14764 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210875-TA
ATGGCTGAAGTACCAGAAATGAGCTCTTCTACTCCCGAAGAATTGCTGGCTGCGGGTCGACGACATTTAGCAGTGAGAGACTATACATCAGCTGCGGAAACACTCGCTACAGCCTGTGAACTGCTTGCAAAGAAACATGGTGAATTTGCAGATGAGTGTGCTGAGGCGTACTTATGGTATGGCAAATCTCTTCTTGGTTTGTCAAGAGAAGAAAATGGTTATTTGGGTGACGGAGTAGCTGGGGGAAATAATGAAGATGGGGACAATGAAGAGGAAAATGGTGAAGAAAATGCTGAAGCGGAACAGAAAAATGGAGAAATTGCAACAGAAGACGGCGATGATGAAGAGAGTAAAACTGAAGGGACAAAAGAAAATGATAAAACTGAACCTGAATCAACTACTAAGGACGAAGAACCTGGAAGTAGTACGGAAAATGAAGAAAAGCCAGGTACATCAAATGGGGATGTAAATGACGACTCCGTAGTTAATCTGGACAATGAAGACGATGTGGATAATTTACAACTCGCCTGGGAAATGTTGGATCTATCAAGAAACATTCTTCAGAAGCGGGCAGAAAATGGCAAGGATGTTCGAGCTTTATTGGCTGAAGTACATCTCGCTCTCGGAGAAGTTGCGCTTGAGAGTGAAACTTATGATAAGGCTGTTGCCGATATGATCAGCTGTTTGGACATACAGAAAGAACTCTACAAAAGTGATGACAGACACATTGCTGAAACCCACTATCAAATTGGTCTGGCAAATTCACTGGCATCGAACTTTGAAGATGCAATCACACACTTCAAAAACGCTGCAAACATTCTCGAAACTAGAATAAAAACTCTGGAAAATCCAAAGACAGTTTCAGAATGTGCCACAGTTAAGAAATACGCTACCACGGACCCATTTTATTCAGTGGAAGGTGAAATTAAGGAATTGAAGGAACTTCTACCGGAGATACAAGAGAAAATTCAAGATATGATGGATTATAAAGCCGAGGTAAGACCGTGCGACTGTGTCGAAGCTACACGAAAGATGCTTAGGAACATGATAATCAAGGAACAGAAACAGTGCAATGGCATTGAAACTATAAAAAGGGTACGCGAAACATTATGCCCTTCAAACGGAGAGAGCAGTAATGGCGCGGGTTCAAGCAAGACGGAACAAAAGGCATCAGACATCTCCCATCTCATAAAAAGAAAGAGAAAGAACAGCGAAGAAGGAGGTTCAGCACCGAAGAGGGTTAATACGTGA

Protein sequence:

>DPOGS210875-PA
MAEVPEMSSSTPEELLAAGRRHLAVRDYTSAAETLATACELLAKKHGEFADECAEAYLWYGKSLLGLSREENGYLGDGVAGGNNEDGDNEEENGEENAEAEQKNGEIATEDGDDEESKTEGTKENDKTEPESTTKDEEPGSSTENEEKPGTSNGDVNDDSVVNLDNEDDVDNLQLAWEMLDLSRNILQKRAENGKDVRALLAEVHLALGEVALESETYDKAVADMISCLDIQKELYKSDDRHIAETHYQIGLANSLASNFEDAITHFKNAANILETRIKTLENPKTVSECATVKKYATTDPFYSVEGEIKELKELLPEIQEKIQDMMDYKAEVRPCDCVEATRKMLRNMIIKEQKQCNGIETIKRVRETLCPSNGESSNGAGSSKTEQKASDISHLIKRKRKNSEEGGSAPKRVNT-