Monarch geneset OGS2.0

DPOGS204353
TranscriptDPOGS204353-TA1245 bp
ProteinDPOGS204353-PA414 aa
Genomic positionDPSCF300142 + 402330-403574
RNAseq coverage22x (Rank: top 78%)
Annotation
HeliconiusHMEL0214731e-17468.66% 
BombyxBGIBMGA007054-TA8e-17570.44% 
DrosophilaCG32238-PA2e-16667.42% 
EBI UniRef50UniRef50_B4MMY13e-16566.50%GK16596 n=23 Tax=Eukaryota RepID=B4MMY1_DROWI
NCBI RefSeqXP_002062551.16e-16666.50%GK16596 [Drosophila willistoni]
NCBI nr blastpgi|1954289981e-16466.50%GK16596 [Drosophila willistoni]
NCBI nr blastxgi|1565494313e-16366.42%PREDICTED: probable tubulin polyglutamylase TTLL1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00064641.2e-207protein modification process
GO:00048351.2e-207tubulin-tyrosine ligase activity
KEGG pathway 
InterPro domain[13-399] IPR0043441.2e-207Tubulin-tyrosine ligase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204353-TA
ATGAATGATAAAGGAACAAAGCGAAATAAAAAGACATTAACTTTTTGTACAGACTTTGAGAAATCGATTGTTATTAACACTTTCTTGAGGCGACGTTGGAGGCAAGTCTCTCCGGATGAAGATTGGAATATTTATTGGTCAAACACTCTCAATTGTCGAAATATATTTAGCCATGACATTGGATATCGGCTTAAGGACAACCAGCTCATAAATCATTTTCCAAATCATTATGAATTGGTGCGTAAAGATTTGTTAGCTCGAAATATAAATCGATATAGGAAAGAATTAGAGAGAGCTGGCAATCCTATCGCTAAGAAAACACATAAAACACTTAACAATGGCCAAAAAATCACAAGATATGTGCATTTAGATTTTATTCCGACTACTTACATCTTGCCCTCGGACTATAAACTGTTCATCGAAGAATATCGCAGAAATCCCCAATACACGTGGATATTAAAACCTTGTGGAAAATCACAAGGAGCTGGCATTTTTATAATAAATAATTTATCTAAACTCAAAAAATGGGCTCGAGAATCCAAAAAATACTTTCAACATCATCTTTTAAGGAAAGACACGTATGTAATATCACGCTACATTCACAATCCGCTTTTGATTGGAGGAAAAAAATTTGATTTAAGGATATACGTCCTTGTAACATCATTCCGTCCTTTAAAAGCTTACATGTATAAACATGGATTTTGCAGAGTTTGTTCCTTAAAGTACAGAGATGCTGAACTTGAAAACATGTTTATTCATCTAACTAACGTGAGCGTTCAAAAACATGGAGAGGAATATAACTGTTATACCGGCGGCAAACTAAGTCTAAATAATTTAAAATTATATTTAGAAGGAACAAGAGGTCAAACGGTAACAAAGAGACTATTCGAAGATATTCAATGGTTAATAGTTCATTCTCTTAAGTCGGTGGCTTTAATTATGTCTAATGATCGCCATTGTTTTGAATGCTACGGATACGATATCATTATAGATAATAACCTAAAACCATGGCTTATAGAAGTTAATGCCTCTCCATCTATGACAGCCACAACAATTAATGACAGGATTCTTAAGTCTAATCTCATTGATAATATTTTATCCGTTGTTTTACCACCAAACGGAATTCCTGATGCGCGTTGGAACAAAGTACCCAGTGAAAATGCTTTAGGGGATTTCGAAAGACTTATAGATGAGGAATGCCTTGGCAAAGATGATATAGAAGATTCTATCTTGGAGATGTTGTGA

Protein sequence:

>DPOGS204353-PA
MNDKGTKRNKKTLTFCTDFEKSIVINTFLRRRWRQVSPDEDWNIYWSNTLNCRNIFSHDIGYRLKDNQLINHFPNHYELVRKDLLARNINRYRKELERAGNPIAKKTHKTLNNGQKITRYVHLDFIPTTYILPSDYKLFIEEYRRNPQYTWILKPCGKSQGAGIFIINNLSKLKKWARESKKYFQHHLLRKDTYVISRYIHNPLLIGGKKFDLRIYVLVTSFRPLKAYMYKHGFCRVCSLKYRDAELENMFIHLTNVSVQKHGEEYNCYTGGKLSLNNLKLYLEGTRGQTVTKRLFEDIQWLIVHSLKSVALIMSNDRHCFECYGYDIIIDNNLKPWLIEVNASPSMTATTINDRILKSNLIDNILSVVLPPNGIPDARWNKVPSENALGDFERLIDEECLGKDDIEDSILEML-