Monarch geneset OGS2.0

DPOGS202233
TranscriptDPOGS202233-TA957 bp
ProteinDPOGS202233-PA318 aa
Genomic positionDPSCF300149 + 559133-561038
RNAseq coverage437x (Rank: top 28%)
Annotation
HeliconiusHMEL0091752e-8083.54% 
BombyxBGIBMGA013532-TA3e-10053.58% 
DrosophilaVps25-PA2e-4550.00% 
EBI UniRef50UniRef50_E2C8N02e-5256.96%Vacuolar protein-sorting-associated protein 25 n=8 Tax=Endopterygota RepID=E2C8N0_HARSA
NCBI RefSeqXP_395839.12e-5462.03%PREDICTED: similar to vacuolar protein sorting 25 [Apis mellifera]
NCBI nr blastpgi|3407139904e-5762.03%PREDICTED: vacuolar protein-sorting-associated protein 25-like [Bombus terrestris]
NCBI nr blastxgi|3407139901e-5562.03%PREDICTED: vacuolar protein-sorting-associated protein 25-like [Bombus terrestris]
Group
Gene OntologyGO:00352996.4e-13inositol pentakisphosphate 2-kinase activity
GO:00055246.4e-13ATP binding
KEGG pathwayame:4123815e-54 
 K12189 (VPS25, EAP20)maps-> Endocytosis
InterPro domain[161-318] IPR0085707.9e-86ESCRT-II complex, vps25 subunit
[247-318] IPR0140407.8e-31ESCRT-II complex, vps25 subunit, C-terminal winged helix
[161-246] IPR0140412.3e-26ESCRT-II complex, vps25 subunit, N-terminal winged helix
[6-163] IPR0092866.4e-13Inositol-pentakisphosphate 2-kinase
Orthology groupMCL11897 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202233-TA
ATGCCTTTAATTGAAAATGTAAATGATTTTTTAGATTTAATTATTGAAATATTACTCAGTGATGGAAAATCTGATATAATTTTGCACAAGTCTACCGATGATATGAACTCAGATTCTACATTGGGTTGTGTTGAAGAAAGTAGCCCATATTCGAACAGTCTTCTTGACAGGTTACTCCGTGTTCAAAAACTATCCGAGAACTTTGATTATTGCTGCCCCCCAACAAGAGATTCACAAGAATATGTGTCATTTCTACTAAACGTGTTAGACGATGAACAATTAGATTTAACAAATATGGCTGATAGAGAACGTTTTTTAAGTCACATAGACTCGAGTCACCTTGCCCTTATATCAGCTGTAGCAAAAGATTGCTCCATCATGATCACATTCACCAAGAAATCCCATGAAAATCTTCCGACAATAAGAATTGGGGATGAAGCAATTGTATACAAGATATCTATAACAGATTTAGAACCCAAAATCCAGCCGCATTCAGAAACAAGATCCAAACAATTACAAGCATGGCAAGAACTTATATCGGAATATTTGAAAGTAACAAAACAATCCACCATAGATGTAAGAGAATCACAGAACAGTCCGCTGTTTAATAATACAGCAATCAACAGGAAGCTGTCTCCGGAGGCAGTTCTGACGGTTTTAGAGGATATGGCGAAATCTGGTAAAGCTGCACCTATAGATAAAAGCAAGAATGTGTGGGAAGTGTACTGGCATTCACTGGACGAGTGGGGTAACATGATATATAGCTGGGCGAGCAGTAATGGGTTAAACAACACCGTTTGTACATTGTATGAGCTCAGAGAGGGCGACAACACTGTTGGTGAAGAGTTCCACGGTCTGGACATGAATGTTTTGTTAAAAGCACTGAAGGCGTTATCAAGCAACGGCAAATGTGAGCTGATAGAGTTTGATGACAACCAAGGAGTGAAATTCTTCTGA

Protein sequence:

>DPOGS202233-PA
MPLIENVNDFLDLIIEILLSDGKSDIILHKSTDDMNSDSTLGCVEESSPYSNSLLDRLLRVQKLSENFDYCCPPTRDSQEYVSFLLNVLDDEQLDLTNMADRERFLSHIDSSHLALISAVAKDCSIMITFTKKSHENLPTIRIGDEAIVYKISITDLEPKIQPHSETRSKQLQAWQELISEYLKVTKQSTIDVRESQNSPLFNNTAINRKLSPEAVLTVLEDMAKSGKAAPIDKSKNVWEVYWHSLDEWGNMIYSWASSNGLNNTVCTLYELREGDNTVGEEFHGLDMNVLLKALKALSSNGKCELIEFDDNQGVKFF-