Monarch geneset OGS2.0

DPOGS211029
TranscriptDPOGS211029-TA1323 bp
ProteinDPOGS211029-PA440 aa
Genomic positionDPSCF300004 + 1660588-1661910
RNAseq coverage642x (Rank: top 20%)
Annotation
HeliconiusHMEL0080420.097.27% 
BombyxBGIBMGA006243-TA0.093.86% 
DrosophilaVps4-PA0.081.90% 
EBI UniRef50UniRef50_Q54PT22e-15060.99%Vacuolar protein sorting-associated protein 4 n=31 Tax=Eukaryota RepID=VPS4_DICDI
NCBI RefSeqNP_001161188.10.093.86%vacuolar protein sorting 4 [Bombyx mori]
NCBI nr blastpgi|3462304140.095.68%vacuolar protein sorting-associating protein 4 [Spodoptera frugiperda]
NCBI nr blastxgi|3462304140.095.68%vacuolar protein sorting-associating protein 4 [Spodoptera frugiperda]
Group
Gene OntologyGO:00055241.1e-40ATP binding
GO:00001662.4e-21nucleotide binding
GO:00171112.4e-21nucleoside-triphosphatase activity
KEGG pathwayphu:Phum_PHUM5379900.0 
 K12196 (VPS4)maps-> Endocytosis
InterPro domain[166-295] IPR0039591.1e-40ATPase, AAA-type, core
[377-437] IPR0154153e-27Vps4 oligomerisation, C-terminal
[3-82] IPR0073301.9e-26MIT
[162-298] IPR0035932.4e-21ATPase, AAA+ type, core
Orthology groupMCL11397 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211029-TA
ATGGCTTCATCTAATACATTACAAAAAGCTATTGATCTTGTCACTAAAGCAACTGAAGAGGATAAAAACAAAAATTATGAAGAAGCCCTACGACTTTACGAACATGGCGTAGAATATTTTTTGCATGCAGTTAAATACGAAGCGCAAGGGGAGAGAGCTAAAGAAAGCATTAGAGCAAAATGCTTGCAATATTTAGATAGAGCTGAAAAACTTAAGGAGTATCTCAAAAAGGATCGTAAGAAGAAACCAGTTAAGGATGGGGAATCAAACTCTAAGAGTGAAGATAAAAAGAGTGATAGCGACAGTGATTCTGATGACCCAGAAAAGAAAAAGCTGCAGGGAAAATTGGAGGGAGCAATTGTGGTTGAGAAACCCCATGTGAAGTGGAGTGATGTAGCTGGACTTGAAGCTGCTAAGGAAGCTCTGAAAGAGGCTGTCATTCTACCAATTAAATTCCCACATTTATTTACTGGCAAAAGAATCCCATGGAAGGGAATATTGCTTTTTGGCCCACCCGGTACTGGTAAATCTTACTTAGCCAAAGCTGTTGCTACTGAAGCAAATAATTCTACATTTTTTTCTGTTTCCTCATCTGACCTTGTATCTAAATGGCTTGGAGAATCAGAGAAATTAGTGAAAAATCTGTTTGAGTTGGCAAGACAGCATAAGCCAAGTATAATATTCATTGATGAAATTGACTCCCTTTGCTCATCCCGGTCGGACAATGAATCTGAATCAGCTAGAAGGATCAAAACTGAGTTTTTAGTACAGATGCAAGGTGTTGGTAATGACATGGATGGTATCTTAGTACTTGGAGCTACTAATATTCCTTGGGTTCTGGATTCTGCTATTAGGAGGAGATTTGAGAAACGTATTTATATAGCTCTCCCTGAAGAACATGCTCGTTTAGATATGTTTAAACTGCATCTAGGTAACACTCGACATATATTGACAGAACAAGATATGAAAACGTTAGCTACAAAATCTGATGGATACTCAGGTGCTGACATTAGTATTGTTGTTCGTGATGCTTTGATGCAGCCTGTGCGAAAAGTCCAGTCTTCAACACACTTTAAGAAAGTTTCTGGACCTAGCCCCACTGACCCTAATGTGATTGTGAATGATCTTTTAACCCCTTGTTCACCAGGAGATGCTGGAGCTATGGAAATGACCTGGATGGATGTACCAAGTGACAAACTTGCTGAACCGCCTGTAACTATGTCAGACATGCTTAGATCCCTTGCAACATCTAAACCTACTGTCAATGATGATGACATGATTAAATTGAAGAAGTTCATGGAAGATTTTGGCCAGGAAGGATAA

Protein sequence:

>DPOGS211029-PA
MASSNTLQKAIDLVTKATEEDKNKNYEEALRLYEHGVEYFLHAVKYEAQGERAKESIRAKCLQYLDRAEKLKEYLKKDRKKKPVKDGESNSKSEDKKSDSDSDSDDPEKKKLQGKLEGAIVVEKPHVKWSDVAGLEAAKEALKEAVILPIKFPHLFTGKRIPWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKNLFELARQHKPSIIFIDEIDSLCSSRSDNESESARRIKTEFLVQMQGVGNDMDGILVLGATNIPWVLDSAIRRRFEKRIYIALPEEHARLDMFKLHLGNTRHILTEQDMKTLATKSDGYSGADISIVVRDALMQPVRKVQSSTHFKKVSGPSPTDPNVIVNDLLTPCSPGDAGAMEMTWMDVPSDKLAEPPVTMSDMLRSLATSKPTVNDDDMIKLKKFMEDFGQEG-