Monarch geneset OGS2.0

DPOGS210429
TranscriptDPOGS210429-TA669 bp
ProteinDPOGS210429-PA222 aa
Genomic positionDPSCF300062 - 358776-362204
RNAseq coverage312x (Rank: top 36%)
Annotation
HeliconiusHMEL0134159e-7699.25% 
BombyxBGIBMGA012644-TA2e-5387.90% 
DrosophilaRab23-PA5e-7560.83% 
EBI UniRef50UniRef50_Q9ULC32e-6856.12%Ras-related protein Rab-23 n=78 Tax=Eukaryota RepID=RAB23_HUMAN
NCBI RefSeqXP_001602379.16e-8768.20%PREDICTED: similar to rab23 [Nasonia vitripennis]
NCBI nr blastpgi|3071674701e-8666.25%Ras-related protein Rab-23 [Camponotus floridanus]
NCBI nr blastxgi|910873498e-9376.07%PREDICTED: similar to GA15247-PA [Tribolium castaneum]
Group
Gene OntologyGO:00072641.4e-57small GTPase mediated signal transduction
GO:00055251.4e-57GTP binding
GO:00150311.4e-57protein transport
GO:00160201.8e-24membrane
GO:00071651.8e-24signal transduction
GO:00061841.8e-24GTP catabolic process
GO:00039241.8e-24GTPase activity
GO:00056222.8e-12intracellular
GO:00068861.1e-05intracellular protein transport
GO:00069131.1e-05nucleocytoplasmic transport
KEGG pathwaynvi:1001184062e-86 
 K06234 (RAB23)maps-> Hedgehog signaling pathway
InterPro domain[1-217] IPR0155931.5e-114Small GTPase superfamily, Rab23
[10-172] IPR0035791.4e-57Small GTPase superfamily, Rab type
[11-169] IPR0018061.7e-48Small GTPase superfamily
[11-162] IPR0052251.7e-27Small GTP-binding protein domain
[7-172] IPR0208491.8e-24Small GTPase superfamily, Ras type
[12-172] IPR0035782.8e-12Small GTPase superfamily, Rho type
Orthology groupMCL12373 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210429-TA
ATGCGGGAAGAAGAGCTGGAGGTAGCTCTGAAGGTTGTGATCGTCGGCGACGGTGGTGTTGGCAAATCGAGTATGATCCAACGTTACTGTCGAGGCACCTTCACCAGGGATTACAAGAAAACCATCGGGGTAGACTTTCTTGAGCGGCAGATCGATATCGATGGAGAGGAGGTGAGACTGATGCTGTGGGACACGGCTGGTCAGGAGGAGTTCGACGCTATAACTAAAGCGTATTACAGAGGAGCCCACGCCTGTGTGCTGGCTTTCTCGACAACAGACAGAGACTCCTTTTTAGCTTTGCATTCATGGAAACTTAAGGTCGAGAATGAATGCGGAGAAATTCCTACAATTATAGTCCAGAATAAAATTGACCTGATGGATCAATGTGTGGTCGGACCTGACGAGGCGGAGTTAGTGGCCCGGGCGCTGGGTTGCCGCCTAATGCGTGCGTCTGTCAAAGAGGACGTGAACGTTGGCGCAGTATTTCGAGCGCTGGCCTCGCGCTGCCTCGCCGACCTGAGAGCTGACAGTGAACCGCCCCCAACAGCAACACCACTTATAGGTACATTTACAAACCATAACAGCAACGGAACTATCTTACTGAGACCTACGAAGAACCGTTCCGGCAAGAAGAAGAACGTACTCAAGAGCGCTTGCCGTATATTGTAA

Protein sequence:

>DPOGS210429-PA
MREEELEVALKVVIVGDGGVGKSSMIQRYCRGTFTRDYKKTIGVDFLERQIDIDGEEVRLMLWDTAGQEEFDAITKAYYRGAHACVLAFSTTDRDSFLALHSWKLKVENECGEIPTIIVQNKIDLMDQCVVGPDEAELVARALGCRLMRASVKEDVNVGAVFRALASRCLADLRADSEPPPTATPLIGTFTNHNSNGTILLRPTKNRSGKKKNVLKSACRIL-