Monarch geneset OGS2.0

DPOGS212324
TranscriptDPOGS212324-TA1023 bp
ProteinDPOGS212324-PA340 aa
Genomic positionDPSCF300019 - 859778-936861
RNAseq coverage95x (Rank: top 62%)
Annotation
HeliconiusHMEL0133722e-3155.81% 
Bombyx% 
DrosophilaCG31381-PA4e-3431.45% 
EBI UniRef50UniRef50_UPI00015B61B73e-4636.36%UPI00015B61B7 related cluster n=5 Tax=unknown RepID=UPI00015B61B7
NCBI RefSeqXP_001604528.16e-4736.36%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565551052e-3837.15%PREDICTED: tRNA dimethylallyltransferase, mitochondrial-like [Nasonia vitripennis]
NCBI nr blastxgi|910928887e-3935.65%PREDICTED: similar to tRNA isopentenyltransferase 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055242.1e-72ATP binding
GO:00080332.1e-72tRNA processing
GO:00090586.1e-05biosynthetic process
GO:00041616.1e-05dimethylallyltranstransferase activity
KEGG pathwaynvi:1001209342e-46 
 K00791 (E2.5.1.75, miaA, TRIT1)maps-> Zeatin biosynthesis
InterPro domain[1-312] IPR0026272.1e-72tRNA isopentenyltransferase
Orthology groupMCL10408 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212324-TA
ATGGTGGTAATCCTAGGAGCCACCGGAACAGGCAAAACTAAGTTAGGTTTGGAGTTGGCCCAGAGATTTGGAACAGAAATAATAAGTGCCGACTCGATGCAGGCAAGTGAGGCAACGCTCGTGTACCTGAGCTGTAGTACTATTGAATACAGTCACAGTCTCATTTGGCCGGAACCTGTTCGCGACGGATCAAGGATTCTAGAACAGAGAATTGGATTGAATTTATTTAGCAATTCGGGCTTTGTGACGGACTCTCAGAATATATACAAAGGACTGGATATAGTAACCGCCAAGGCGTCTCCTCAAGAACGGGAGCTGGTCAAACATCATCTGCTGGACATTCTGGAACCGCACCAGAACTTCACGGTGGTGGACTTTAGAAACCGAGCCCTTAGTATTATAGGTAATTTAACCGAACAAGGTAAAATACCCATAGTTGTAGGAGGTACGAACTACTATATAGAATCCATCGTGTATAATATATTGGTCGAAGACATGAACGATCCGGAGGCGTTGCTGTGGGATAAAAGTAAAAGGAAAAGAAACTTCGCGGAGGACTTTGATGAAATGCCAATAAAAAAAGCTGCGCTTGATCCGAGCGACGGTGCCGGAGACAGTTTAGTACCAGAATCTGATGGGGAAGTCAATTCGGATGTGAAGACGAATAGCAATAGTATTGACGTCAGTAAATTAAAAGAAGATGTCGACAATGAGAGGAAGTTCACTAATGAAGAGATTCATGAAAGATTAAAAGCCGTCGATCCGGTGATGGCTTCCAGGTTGCATCCGAATAATAGACGCAAAGTGTTAAGATCAATAGAGGTATGGTTGAAGACAGGAAGACGTCATAGCGACATCCTAGAAGAACAGAAGACGAGTGAAGGAAGACTGAGAAGACCAGGGTCTACTATTGTACTGTGGCTGAAGTGTGACCAGGTGTGTAAAAAACGTAAACAACGGGGTCACCTGCGCGCCGAGATTGGCAGGTGGTGCGCATGCGCATCTGCACTACGTTACGCATAA

Protein sequence:

>DPOGS212324-PA
MVVILGATGTGKTKLGLELAQRFGTEIISADSMQASEATLVYLSCSTIEYSHSLIWPEPVRDGSRILEQRIGLNLFSNSGFVTDSQNIYKGLDIVTAKASPQERELVKHHLLDILEPHQNFTVVDFRNRALSIIGNLTEQGKIPIVVGGTNYYIESIVYNILVEDMNDPEALLWDKSKRKRNFAEDFDEMPIKKAALDPSDGAGDSLVPESDGEVNSDVKTNSNSIDVSKLKEDVDNERKFTNEEIHERLKAVDPVMASRLHPNNRRKVLRSIEVWLKTGRRHSDILEEQKTSEGRLRRPGSTIVLWLKCDQVCKKRKQRGHLRAEIGRWCACASALRYA-