Monarch geneset OGS2.0

DPOGS200728
TranscriptDPOGS200728-TA1566 bp
ProteinDPOGS200728-PA521 aa
Genomic positionDPSCF300030 - 108776-117536
RNAseq coverage1735x (Rank: top 7%)
Annotation
HeliconiusHMEL0089560.078.25% 
BombyxBGIBMGA001126-TA0.081.96% 
DrosophilaRep-PA2e-8736.20% 
EBI UniRef50UniRef50_E1ZY694e-13946.43%Rab proteins geranylgeranyltransferase component A 2 n=9 Tax=Formicidae RepID=E1ZY69_CAMFO
NCBI RefSeqXP_001950164.13e-14247.39%PREDICTED: similar to Choroideremia [Acyrthosiphon pisum]
NCBI nr blastpgi|1935799327e-14147.39%PREDICTED: rab proteins geranylgeranyltransferase component A 1-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1935799323e-13747.39%PREDICTED: rab proteins geranylgeranyltransferase component A 1-like [Acyrthosiphon pisum]
Group
KEGG pathwaypic:PICST_904442e-27 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[1-522] IPR0166642.6e-155Rab protein geranylgeranyltransferase component A, eukaryota
[6-485] IPR0182038.9e-80GDP dissociation inhibitor
Orthology groupMCL10995 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200728-TA
ATGGACGACGATTTACCTACAGACTTCCAGGTCATCGTCGTGGGGACGGGCATGGTTGAGTCCATCGTAGCTGCAGCATGCAGTCGAATAGGCAAAAATGTGCTACATCTAGACTCGAGTGACCACTATGGGGGTCTTTGGGCCTCATATAATTTTGAGGGTCTTCAGAAATTTATTAAGGAAATTAACTCGGACCCAAACAGGCAACTCCAGGTGTACAACTTGATTGAAAAGTGGTATATTGATAAAGACTCACCTCAAGAGGAAACGAAACAAGAGACTGAAGATGAAAAGACTGAGCCCCCCAAGAAGATATGGAGCCAAGCCGACTTTGCTTCCGAGTACAGGAAGTTTAATATTGACATGACACCAAAGCTGCTGTTTTCCCGGGGGCCGTTAGTGGAGCTCCTAATATCTTCGAATATTGCTCGTTATGCTGAGTTCCGATGCGTGACACGTGTTCTCACTTGGCTCAATGACAAGTTGAATCCTGTCCCCTGTTCCCGGGCTGACGTGTTCGCTACGGAGGCTGTCAGCATCGTGGAGAAGAGGATGCTCATGAAAATGCTCACTTCCATCGTAGGGTACAATGAAGAAGAGATGGACAATGAATTTAAGGATTGGACCGACAAATCCTTCAAGGACTACCTGACTCACAAGGGTCTGACACCGAATCTGATCCACTACGTGTTGTACGCTATCGCCGGCGGTTCTGACGCTATGCCGTGTCTGGAGGGTGTTAGGGAATGCAAGAAGTTCCTGATGAGTCTCGGCCGTTATGGGAACACGCCGTTCCTTTGGCCGATGTACGGCAGCGGGGAACTGCCTCAGGGCTTCTGTCGACTATGCGCGGTATTCGGCGGCGTGTATTGCTTGAATCGTCCGATAGATTCGGTCGAGACTAAGACGGGTGACGAAGGGAAAGAGATCGTGGTCATCGGCAGCAAGGCCAAGAACTTGAATTGCGATCATCTAGTCATCGGTATAAACGAGTGTCCCAAGGATCTACTGTCCTCGGAGCCGAGCGAGAGCTCTGATATATCCAAGGCTATATTCATTACCAATGGCACTATAATGCCAAGCGAGAAAGAACCGCTGACCCTACTGAGATTCCCGCCGCTAGATGAAGGTGACAACCCTGTTACTGTTCTCGAAGTCGGACCAGCCACCGGCTCCTGCCCTAAAGGCCTCTTCGCCGTTTACTTCATAACGAACAAGGTTAAGGATGCCGAGAGCGATCTCATGAAGTACGCGGAGAAGATCTTCGACATGACTGGAGACCAGACCAAGGCTGGAGACAAGCCGACGTGCCTGTGGTCTCTGTTCTACAACGTGAAGGATGTGAGCGCGGGAGTACGTGACGGCGTTGAGACGGTCCACGTGTGTGCCGGACCAGACGCCGGGCTGGACTTCGACCGCGCCGTACTCCAGGCGGAACAAATCTTCAAGAAGATCTGCCCGGGCGAGGAGTTCTTGCCCCGCGCCCCGGATCCTGAAGACATCGTCTTCGAGGATGACGTCACTCACGGGCCGGAGTTCCGCGGCGACGAAGGAGACAAAGAGTAA

Protein sequence:

>DPOGS200728-PA
MDDDLPTDFQVIVVGTGMVESIVAAACSRIGKNVLHLDSSDHYGGLWASYNFEGLQKFIKEINSDPNRQLQVYNLIEKWYIDKDSPQEETKQETEDEKTEPPKKIWSQADFASEYRKFNIDMTPKLLFSRGPLVELLISSNIARYAEFRCVTRVLTWLNDKLNPVPCSRADVFATEAVSIVEKRMLMKMLTSIVGYNEEEMDNEFKDWTDKSFKDYLTHKGLTPNLIHYVLYAIAGGSDAMPCLEGVRECKKFLMSLGRYGNTPFLWPMYGSGELPQGFCRLCAVFGGVYCLNRPIDSVETKTGDEGKEIVVIGSKAKNLNCDHLVIGINECPKDLLSSEPSESSDISKAIFITNGTIMPSEKEPLTLLRFPPLDEGDNPVTVLEVGPATGSCPKGLFAVYFITNKVKDAESDLMKYAEKIFDMTGDQTKAGDKPTCLWSLFYNVKDVSAGVRDGVETVHVCAGPDAGLDFDRAVLQAEQIFKKICPGEEFLPRAPDPEDIVFEDDVTHGPEFRGDEGDKE-