Monarch geneset OGS2.0

DPOGS202778
TranscriptDPOGS202778-TA1320 bp
ProteinDPOGS202778-PA439 aa
Genomic positionDPSCF300018 - 1004402-1005721
RNAseq coverage290x (Rank: top 38%)
Annotation
HeliconiusHMEL0093000.089.75% 
BombyxBGIBMGA010499-TA0.082.76% 
DrosophilaCG5776-PA3e-13053.38% 
EBI UniRef50UniRef50_UPI00021A82865e-14755.88%UPI00021A8286 related cluster n=3 Tax=unknown RepID=UPI00021A8286
NCBI RefSeqXP_625214.23e-15056.53%PREDICTED: similar to spermatogenesis associated factor SPAF [Apis mellifera]
NCBI nr blastpgi|3800118917e-15056.98%PREDICTED: spermatogenesis-associated protein 5-like [Apis florea]
NCBI nr blastxgi|3800118912e-14256.98%PREDICTED: spermatogenesis-associated protein 5-like [Apis florea]
Group
Gene OntologyGO:00055242.4e-43ATP binding
GO:00001662.2e-18nucleotide binding
GO:00171112.2e-18nucleoside-triphosphatase activity
KEGG pathwayptr:4614746e-135 
 K13525 (VCP, CDC48)maps-> Protein processing in endoplasmic reticulum
InterPro domain[214-344] IPR0039592.4e-43ATPase, AAA-type, core
[210-346] IPR0035932.2e-18ATPase, AAA+ type, core
Orthology groupMCL12310 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202778-TA
ATGAAAGATTTGTTTGCAAAAGCGATAGCAAACGAACCAAGTATAATATTAGTAGATGAGATAGAAACGATATGCCCCCGCCATTCATCAGCCAGCACAGAACAGGAAAGACGCGTCACCTCCGCGTTTGTGTCTTTATTGGATAACCTACACCAAGATTCTAGCAGAGTATTTGTACTAGCTACTACAAGAAAACCAGAGGCAATTGATCCAATGTTGAGGAGATTCGGAAGGTTGGACAAAGAAGTAGAAGTACCCGTCCCCGACAGAAAACGACGCGCAGATATATTATACGCTTTACTCAAAAATCTTCCAAACAAAGTGTCATCACAGGACATGGAGGCGATATCAGACTTAGCTCACGGTTATGTCGCCGCCGACCTCGTGAACTTATGCTCTCAGGCCTCGATGAAATGTTTAAAAAGAATGAGTGAAGCCATAGAGAAAGAGGATTTGATAGGTGCTTTGACTGTGGTCCGGCCCAGTGCTATGAGAGAGCTTTTGATAGAGATCCCTAACGTGAGGTGGTCAGATATAGGAGGGCAGGACGGACTCAAGCTGAAACTGCGGCAAGCCGTCGAGTGGCCCTTGAAGCATCCTGAAAGTTTCCTGCGTTTAGGAATACGTCCGCCGGCTGGCGTCCTGCTCTATGGACCGCCCGGCTGTTCCAAAACTATGATAGCGAAAGCGTTAGCCACCGAGAGCGGACTCAACTTTTTATCAATAAAGGGCCCAGAGTTATTTTCAAAGTGGGTGGGCGAATCTGAACGTGCAGTGAGAGATCTCTTCACCAAAGCCCGACAGGTGGCTCCTTCGATAATATTCTTCGATGAAATGGATGCAATCGGAGGCGAGCGTGGCGCGGGCGAGGCCGGGGTGCACGAACGAGTCCTTGCACAGCTATTGACCGAATTGGACGGTGTGGTCCCATTAAATTCCGTCACTATTCTAGCTGCCACCAACAGACCCGATAGAATGGATCGGGCCTTACTCCGACCGGGTCGTATCGACCGACTCATTTACGTTCCTCTACCCGATTTCGAAACCAGGTTGCAAATTATAGAGTTGAAGCTGTCCAAAATGAGCACAAGCGACGATGTAAATCCACATGTATTGGCCATCAAGAGTGAAGGCTTCTCAGGAGCCGAGCTACACGCGCTGTGCCACGAGGCAGCCATGCGGGCCTTGGAGAAAGATCTCAATTGTCAAGAGGTTACCATGGAGCACTTCGAGCATGTGTTCAAAGACTTCAAACCTCGGACTCCAGATTCACTTCTGAAGATATACGAAGAGTTTTCACTGGGCCAACGCTCGGGGTGA

Protein sequence:

>DPOGS202778-PA
MKDLFAKAIANEPSIILVDEIETICPRHSSASTEQERRVTSAFVSLLDNLHQDSSRVFVLATTRKPEAIDPMLRRFGRLDKEVEVPVPDRKRRADILYALLKNLPNKVSSQDMEAISDLAHGYVAADLVNLCSQASMKCLKRMSEAIEKEDLIGALTVVRPSAMRELLIEIPNVRWSDIGGQDGLKLKLRQAVEWPLKHPESFLRLGIRPPAGVLLYGPPGCSKTMIAKALATESGLNFLSIKGPELFSKWVGESERAVRDLFTKARQVAPSIIFFDEMDAIGGERGAGEAGVHERVLAQLLTELDGVVPLNSVTILAATNRPDRMDRALLRPGRIDRLIYVPLPDFETRLQIIELKLSKMSTSDDVNPHVLAIKSEGFSGAELHALCHEAAMRALEKDLNCQEVTMEHFEHVFKDFKPRTPDSLLKIYEEFSLGQRSG-