Monarch geneset OGS2.0

DPOGS210108
TranscriptDPOGS210108-TA1272 bp
ProteinDPOGS210108-PA423 aa
Genomic positionDPSCF300017 + 1195481-1217077
RNAseq coverage555x (Rank: top 23%)
Annotation
HeliconiusHMEL0053796e-12890.11% 
BombyxBGIBMGA000217-TA2e-12079.85% 
DrosophilaCG34104-PB4e-5139.02% 
EBI UniRef50UniRef50_A7UV033e-5544.09%AGAP003931-PA n=1 Tax=Anopheles gambiae RepID=A7UV03_ANOGA
NCBI RefSeqXP_970876.12e-5252.28%PREDICTED: similar to AGAP003932-PA [Tribolium castaneum]
NCBI nr blastpgi|3479709321e-5444.09%AGAP003931-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479709328e-5141.61%AGAP003931-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00072643.1e-51small GTPase mediated signal transduction
GO:00056223.1e-51intracellular
GO:00055253.1e-51GTP binding
GO:00160206.2e-17membrane
GO:00071656.2e-17signal transduction
GO:00061846.2e-17GTP catabolic process
GO:00039246.2e-17GTPase activity
GO:00150312.3e-12protein transport
KEGG pathway 
InterPro domain[238-408] IPR0035783.1e-51Small GTPase superfamily, Rho type
[237-402] IPR0018065e-41Small GTPase superfamily
[235-399] IPR0052251.7e-20Small GTP-binding protein domain
[233-408] IPR0208496.2e-17Small GTPase superfamily, Ras type
[236-408] IPR0035792.3e-12Small GTPase superfamily, Rab type
Orthology groupMCL17915 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210108-TA
ATGTCTCCTAGCTTAGAGGGTATCATGTTTAACAGGAACGCGAACCTGTCAGCTCGCCGGCCATTGCTCGCGGAACCACAGAAGCCAGTCGTGGTTCCACCTCGAGCACCACCCCGTGACTATGGTAAGTACGGTAGACCTGTGCCAGCGGCGAGGAACCCTTCAGCCGACGAAGGACGATTCGAATACAGATCCGATTATTGGAAACCGTCAAATGTTGACATCGAAAAACGGCTGCAGGAGACGAGAACACGAGAGAAGATCGGATACTTTCAAGACAAAGTGACCCCACCGGAATTAAACTTGGATCGTAGGGAAGCGGAATTAGTGTTTAAATTCGACCCTCACGCATCCGCTGAATATTTATCGAGTCAGTATCAAGCTCCAACAGTTCAATATGCAAAGCCCCCGACGCCGGCAAACGGGTTTGCTAACCGTCTACCGGAGAGAGATGGTCCTTTTGTATTCGGAGTGCACAGTCCCAGTCAGTTCCCTATACCGCGTCGGGACGAAGACGACTACGACTACTCCGAGGTGGCCGAGGAGCACGGCCGTCCAGTGATACGAGACGACGAATCTACGGAACTCAATTGTTGTGATAAAGTGAATGAGTGGACGGTTAGGGATAAGAAGAGTCGGAGGACTTTAAATCGTATGTTCCAAATAAAGGATCGACGGAAAGTTAAAGGTGGGAAGAAAGAAAAGATTAAGTGTGTGTTGGTGGGAGATGGAGCGGTGGGAAAGAGTTCCTTAATAGCTGCGTACGCCCAGGACACCTTTCGGGAAGAATATCAGCCGACCGCATACGACACATTTAATGTTGTGGTTGACGTTGATGACAGGCCGGTCTGTGTGGAAATCTGTGATACTGCGGGTCAGGACTCAATGTCCGAGCTCCGCGAGCTGTGCTATCCCGGTACCGATGTCCTGATGCTCTGTTTCTCCGTGGTTCGTCCGGAGACGTTCAAGTCAGTCGCCGATCGCTGGATCCGCGCCGTGTCCTCGGTGCAGGCTCCAGTAGTGCTCGTGGGGACGCAGAGCGACCTGGCCCTGGACGGACGTGTGATACAGACTTTACGGGCCCGCAACGAACACGCGGTAACAGAAGCTGAAGCGAGAGCATTGGCGGCAAAAATAAACGCCACGTACATAGAGACGTCAGCTAAGACACGGAAACAGCTGAAGGACGCCTTCGACGCCGCCATCTTGGCAGGGCTTCCAGTTGTACAGAACAAACGACCGCTATGGAAGAAATTATTATGCCTTAACTAG

Protein sequence:

>DPOGS210108-PA
MSPSLEGIMFNRNANLSARRPLLAEPQKPVVVPPRAPPRDYGKYGRPVPAARNPSADEGRFEYRSDYWKPSNVDIEKRLQETRTREKIGYFQDKVTPPELNLDRREAELVFKFDPHASAEYLSSQYQAPTVQYAKPPTPANGFANRLPERDGPFVFGVHSPSQFPIPRRDEDDYDYSEVAEEHGRPVIRDDESTELNCCDKVNEWTVRDKKSRRTLNRMFQIKDRRKVKGGKKEKIKCVLVGDGAVGKSSLIAAYAQDTFREEYQPTAYDTFNVVVDVDDRPVCVEICDTAGQDSMSELRELCYPGTDVLMLCFSVVRPETFKSVADRWIRAVSSVQAPVVLVGTQSDLALDGRVIQTLRARNEHAVTEAEARALAAKINATYIETSAKTRKQLKDAFDAAILAGLPVVQNKRPLWKKLLCLN-