Monarch geneset OGS2.0

DPOGS204401
TranscriptDPOGS204401-TA627 bp
ProteinDPOGS204401-PA208 aa
Genomic positionDPSCF300002 - 1055214-1055840
RNAseq coverage1762x (Rank: top 7%)
Annotation
HeliconiusHMEL0156832e-12099.04% 
BombyxBGIBMGA007712-TA2e-11998.08% 
DrosophilaRab7-PA2e-9879.81% 
EBI UniRef50UniRef50_P511499e-10284.13%Ras-related protein Rab-7a n=243 Tax=root RepID=RAB7A_HUMAN
NCBI RefSeqNP_001040368.15e-11898.08%Rab7 [Bombyx mori]
NCBI nr blastpgi|1140513688e-11798.08%Rab7 [Bombyx mori]
NCBI nr blastxgi|1140513683e-11498.08%Rab7 [Bombyx mori]
Group
Gene OntologyGO:00072646.5e-95small GTPase mediated signal transduction
GO:00055256.5e-95GTP binding
GO:00150316.5e-95protein transport
GO:00160202.3e-30membrane
GO:00071652.3e-30signal transduction
GO:00061842.3e-30GTP catabolic process
GO:00039242.3e-30GTPase activity
GO:00068861.3e-13intracellular protein transport
GO:00069131.3e-13nucleocytoplasmic transport
GO:00056223e-12intracellular
KEGG pathwaycqu:CpipJ_CPIJ0090891e-107 
 K07897 (RAB7A)maps-> Amoebiasis
    Endocytosis
    Phagosome
InterPro domain[9-176] IPR0035796.5e-95Small GTPase superfamily, Rab type
[10-174] IPR0018066.1e-58Small GTPase superfamily
[6-176] IPR0208492.3e-30Small GTPase superfamily, Ras type
[10-169] IPR0052253.8e-26Small GTP-binding protein domain
[14-206] IPR0020411.3e-13Ran GTPase
[11-176] IPR0035783e-12Small GTPase superfamily, Rho type
Orthology groupMCL14862 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204401-TA
ATGTCTTCAAGAAAAAAGGTTCTTTTAAAAGTTATTATCCTAGGAGATAGCGGTGTCGGTAAAACATCCTTGATGAATCAGTTTGTCAACAAGAAATTTTCCAATCAGTACAAGGCTACAATAGGTGCTGATTTCCTCACGAAGGAAGTAATAGTTGACGATAGAATCGTCACTATGCAAATTTGGGATACGGCAGGGCAAGAGAGATTCCAATCTTTGGGGGTGGCGTTCTACCGTGGGGCGGACTGTTGCGTCTTAGTTTTTGACGTAACTGCCCCTAACACGTTTAAGTCCTTGGAAAGTTGGCGAGATGAATTTCTAATACAGGCTTCACCTCGAGATCCTGACAATTTCCCGTTTGTTATACTAGGTAATAAGGTCGATTTGGAGAACCGTGCTGTTTCTGCTAAAAGAGCCCAACAGTGGTGTCAAAGCAAAAATGATATCCCATACTTCGAAACAAGTGCCAAAGAAGCAGTTAACGTTGAGCTCGCATTCCAAACTATCGCACGTAATGCTTTAGCTCAAGAGACTGAAGCTGAGCTGTATAATGAATTCCCAGATCAAATCAAGCTAAACGCTAATGACAACGGCCGTAACAGAGATGGAGATAACTGTGCTTGCTAA

Protein sequence:

>DPOGS204401-PA
MSSRKKVLLKVIILGDSGVGKTSLMNQFVNKKFSNQYKATIGADFLTKEVIVDDRIVTMQIWDTAGQERFQSLGVAFYRGADCCVLVFDVTAPNTFKSLESWRDEFLIQASPRDPDNFPFVILGNKVDLENRAVSAKRAQQWCQSKNDIPYFETSAKEAVNVELAFQTIARNALAQETEAELYNEFPDQIKLNANDNGRNRDGDNCAC-