Monarch geneset OGS2.0

DPOGS214371
TranscriptDPOGS214371-TA2418 bp
ProteinDPOGS214371-PA805 aa
Genomic positionDPSCF300020 + 800278-808506
RNAseq coverage4586x (Rank: top 3%)
Annotation
HeliconiusHMEL0050130.099.13% 
BombyxBGIBMGA003985-TA0.098.03% 
DrosophilaTER94-PC0.088.36% 
EBI UniRef50UniRef50_G6CWA00.0100.00%Transitional endoplasmic reticulum ATPase TER94 n=8 Tax=Endopterygota RepID=G6CWA0_DANPL
NCBI RefSeqNP_001037003.10.098.01%transitional endoplasmic reticulum ATPase TER94 [Bombyx mori]
NCBI nr blastpgi|1129833220.098.01%transitional endoplasmic reticulum ATPase TER94 [Bombyx mori]
NCBI nr blastxgi|1129833220.098.01%transitional endoplasmic reticulum ATPase TER94 [Bombyx mori]
Group
Gene OntologyGO:00167872.4e-248hydrolase activity
GO:00055242.6e-46ATP binding
GO:00054882.3e-41binding
GO:00001661.5e-24nucleotide binding
GO:00171111.5e-24nucleoside-triphosphatase activity
KEGG pathwayaga:AgaP_AGAP0056300.0 
 K13525 (VCP, CDC48)maps-> Protein processing in endoplasmic reticulum
InterPro domain[25-763] IPR0059382.4e-248ATPase, AAA-type, CDC48
[239-368] IPR0039592.6e-46ATPase, AAA-type, core
[20-104] IPR0090102.3e-41Aspartate decarboxylase-like fold
[508-647] IPR0035931.5e-24ATPase, AAA+ type, core
[23-104] IPR0033381.5e-22ATPase, AAA-type, VAT, N-terminal
[125-187] IPR0042012.7e-09Cell division protein 48, Cdc48, domain 2
Orthology groupMCL11924 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214371-TA
ATGGCAGATAATAAGAGCCCTGATGATCTTTCTACCGCGATCCTGCGCCGTAAAGACAGGCCCAATCGTCTGATCGTCGAGGAGGCCGTCAGTGATGATAACTCGGTCGTCGCCCTATCACAGGGGAAAATGGAGCAGCTACAGTTATTCCGCGGTGACACTGTACTGCTTAAGGGCAAGCGTCGCAAGGAAACTGTATGCATCGTGCTCTCAGATGACAACTGCCCCGATGAGAAGATACGTATGAACCGTGTAGTTAGGAACAATCTGCGCGTTCGTTTGTCAGATGTGGTGTCTATAGCACCCTGTCCGTCAGTCAAATACGGCAAGCGTGTCCACATACTTCCAATTGATGATTCTGTTGAAGGCCTCACCGGTAATCTATTTGAGGTGTACTTGAAGCCTTACTTCATGGAGGCCTACCGTCCGATTCACCGTGACGACACGTTCATGGTGCGCGGTGGCATGAGAGCTGTGGAGTTCAAGGTGGTGGAGACAGACCCCGCTCCTTACTGTATCGTGGCCCCCGACACCGTCATTCATTGCGAAGGGGAACCTATTAAACGAGAAGAAGAGGAAGAAGCTCTCAACGCTGTCGGCTACGATGATATCGGCGGTTGTCGCAAGCAGTTGGCACAGATCAAGGAGATGGTGGAGCTGCCCCTGCGGCATCCCTCGCTGTTCAAAGCTATCGGCGTGAAACCTCCGCGCGGCATCCTCATGTACGGCCCCCCGGGGACAGGGAAGACGCTCATCGCTAGAGCCGTCGCCAATGAGACCGGTGCGTTCTTCTTCCTGATCAACGGCCCTGAGATCATGTCCAAGCTGGCGGGCGAATCTGAATCCAACCTGCGTAAGGCTTTCGAGGAGGCTGACAAGAATTCTCCAGCTATCATCTTCATAGATGAGTTGGATGCCATCGCTCCCAAACGAGAGAAGACACACGGGGAAGTCGAAAGAAGAATCGTGTCACAGCTGCTTACTCTTATGGATGGTATGAAGAAGTCGTCTCATGTGATAGTAATGGCCGCCACCAACCGTCCCAACTCGATCGACCCGGCGCTGCGGCGCTTCGGACGGTTTGATCGGGAGATAGACATCGGCATCCCTGACGCCACCGGGCGGCTCGAGATACTGCGCATTCACACCAAGAATATGAAGCTTGGAGACGACGTGGACCTAGAACAGATTGCAGCTGAATCTCATGGTCATGTGGGTGCCGATCTGGCTTCCCTGTGCTCGGAAGCAGCTCTGCAACAGATCAGAGAGAAGATGGACCTCATTGACCTGGAAGATGACCAGATTGATGCTGAAGTACTCAATTCCTTGGCTGTCTCCATGGATAACTTCCGTTATGCGATGACCAAATCCTCTCCATCGGCACTCCGTGAAACTGTGGTGGAAGTGCCCAACGTAACGTGGACTGACATCGGTGGTCTCCAGAACGTTAAGCGAGAGCTTCAAGAGCTGGTGCAGTATCCCGTGGAACATCCTGACAAGTTCCTTAAGTTCGGTATGCAGCCTTCCAGGGGTGTGCTGTTCTATGGGCCGCCGGGATGTGGTAAGACGTTGCTGGCTAAGGCGATTGCTAATGAGTGTCAAGCCAACTTCATCTCTGTCAAGGGACCAGAGTTACTCACTATGTGGTTTGGTGAATCCGAGGCCAATGTTAGAGACATCTTCGATAAGGCTCGTTCCGCGTCTCCGTGTGTGTTGTTCTTCGACGAGTTGGATTCCATCGCCAAGTCCCGCGGCGGGTCCGTGTCGGACGCCGGCGGCGCCGCCGACCGCGTCATCAACCAGATACTCACAGAGATGGACGGCATGGGCGCTAAGAAGAACGTGTTCATTATCGGTGCCACAAATCGTCCCGACATCATCGACCCGGCCATCCTCCGTCCCGGTCGTCTGGACCAGCTGATCTACATCCCTCTACCGGACGAGAAGTCCCGCGAGGCCATACTGAGGGCCAATCTCCGCAAGTCGCCCATAGCTAAGGACGTTGACCTATCCTACATCGCTAAGGTGACACAGGGCTTCAGTGGCGCTGATCTGACCGAGATCTGCCAGCGCGCCTGCAAGCTCGCCATCAGACAGGCCATCGAGGCGGAGATACACCGCGAGAGGGCGCGCCAGCAGTCACAACCCGCGGCCGTCATGGATATGGACGAAGAGGACCCGGTACCGGAGATCAGCCGCGCTCACTTCGAGGAGGCGATGAAGTTCGCGAGACGTTCTGTGTCCGACAACGACATCCGCAAGTACGAGATGTTCGCGCAGACGCTGCAACAGAGCAGGGGCTTCGGCACTAACTTCAGATTCCCTACAAGTGGTGCGTCAGCGGGCGGGACGGGAACGTCTGGGGGTGACCAGCCCACTTTCCAGGAGGAGGGGGGTGACGATGACCTCTATAGCTAA

Protein sequence:

>DPOGS214371-PA
MADNKSPDDLSTAILRRKDRPNRLIVEEAVSDDNSVVALSQGKMEQLQLFRGDTVLLKGKRRKETVCIVLSDDNCPDEKIRMNRVVRNNLRVRLSDVVSIAPCPSVKYGKRVHILPIDDSVEGLTGNLFEVYLKPYFMEAYRPIHRDDTFMVRGGMRAVEFKVVETDPAPYCIVAPDTVIHCEGEPIKREEEEEALNAVGYDDIGGCRKQLAQIKEMVELPLRHPSLFKAIGVKPPRGILMYGPPGTGKTLIARAVANETGAFFFLINGPEIMSKLAGESESNLRKAFEEADKNSPAIIFIDELDAIAPKREKTHGEVERRIVSQLLTLMDGMKKSSHVIVMAATNRPNSIDPALRRFGRFDREIDIGIPDATGRLEILRIHTKNMKLGDDVDLEQIAAESHGHVGADLASLCSEAALQQIREKMDLIDLEDDQIDAEVLNSLAVSMDNFRYAMTKSSPSALRETVVEVPNVTWTDIGGLQNVKRELQELVQYPVEHPDKFLKFGMQPSRGVLFYGPPGCGKTLLAKAIANECQANFISVKGPELLTMWFGESEANVRDIFDKARSASPCVLFFDELDSIAKSRGGSVSDAGGAADRVINQILTEMDGMGAKKNVFIIGATNRPDIIDPAILRPGRLDQLIYIPLPDEKSREAILRANLRKSPIAKDVDLSYIAKVTQGFSGADLTEICQRACKLAIRQAIEAEIHRERARQQSQPAAVMDMDEEDPVPEISRAHFEEAMKFARRSVSDNDIRKYEMFAQTLQQSRGFGTNFRFPTSGASAGGTGTSGGDQPTFQEEGGDDDLYS-