Monarch geneset OGS2.0

DPOGS209916
TranscriptDPOGS209916-TA1242 bp
ProteinDPOGS209916-PA413 aa
Genomic positionDPSCF300519 + 28981-30222
RNAseq coverage54x (Rank: top 69%)
Annotation
HeliconiusHMEL0214730.091.79% 
BombyxBGIBMGA007058-TA0.090.53% 
DrosophilaCG32238-PA0.077.70% 
EBI UniRef50UniRef50_B4MMY10.076.72%GK16596 n=23 Tax=Eukaryota RepID=B4MMY1_DROWI
NCBI RefSeqXP_001862924.10.078.08%tubulin-tyrosine ligase [Culex quinquefasciatus]
NCBI nr blastpgi|1700539890.078.08%tubulin-tyrosine ligase [Culex quinquefasciatus]
NCBI nr blastxgi|1700539890.078.08%tubulin-tyrosine ligase [Culex quinquefasciatus]
Group
Gene OntologyGO:00064641.2e-235protein modification process
GO:00048351.2e-235tubulin-tyrosine ligase activity
KEGG pathway 
InterPro domain[4-394] IPR0043441.2e-235Tubulin-tyrosine ligase
Orthology groupMCL10866 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209916-TA
ATGGATAAAGGCCGTGTTACATATTGTACGGATTTAGAAAAATCTGTAATAATAACAAACTTTGAACGTAGAGGTTGGATTCAAGTGGGATCGGAGGAGGAGTGGAATTTTTACTGGTCATTCACACAAAACTGTCGCAATATTTTCAGTATTGAAAGTGGGTATAGAATGAACGATAACCAAATTATAAATCACTTTCCCAATCACTACGAGCTATCTCGTAAGGACTTACTGGTAAAAAACATAAAAAGGTACCGTAAGGAACTCGAACGAGAAGGTAATCCGTTAGCGGAGAAAAAGGAAGTGACGTTAGCGAATGGGCAAACTGTAACACGCTATATACACTTGGATTTTATCCCCGTCACTTACGTGCTGCCGGCCGATTACAATATGTTTGTTGAAGAATATCGCAAATCCCCGCAAAGCACTTGGATCATGAAACCATGCGGAAAATCTCAAGGAGCTGGGATTTTCCTAATAAATAAACTTTCCAAGTTAAAGAAATGGTCTCGAGAAGCGAAAACACCCTTACACCCTCAACTGGGTAGTAAAGAAAGTTATGTAATATCACGCTATATTGACAACCCTTTGTTGATCGGTGGGAAGAAGTTTGATCTCAGATTATATGTCTTAATAACTTCGTTTAGACCTTTAAAAGCATATTTATTCCAACATGGATTTTGCAGATTCTGTACGGTGAAGTATGACACTAGTGTAACGGAGTTGGACAATATGTACGTTCACTTGACAAATGTAAGTGTCCAAAAACACGGAGGTGACTACAATAGTTTGCATGGTGGTAAAATGAGTATTCAAAATTTCCGTCTTTATTTAGAAGGCACTCGAGGACGTTCAGTCACCGATAAACTGTTCGCTGATATGCAGTGGCTAATAGTCCATTCACTGAAAGCGGTTGCACCAGTAATGGCGAATGATCGTCATTGCTTTGAGTGTTACGGTTATGACATTATCATAGATAATGCACTCAAGCCCTGGCTAGTAGAGGTAAATGCATCACCCTCCCTACAATCAACAACTCATAGCGATAGAATATTGAAATACAAATTGATAGATAACATTGTATCCGTCGTGGTACCCCCGGGAGGAATTCCCGACGCCAGGTGGAATAAAATTCCAACCCCAGAAGCTTTAGGTGATTTCGATGTTTTGATTGATGAGGAGCTTATGGAAAAAGAGGATTCCGCTCCTCGAAATAGTAAAAGTAACCGATGCAAGCCTTAA

Protein sequence:

>DPOGS209916-PA
MDKGRVTYCTDLEKSVIITNFERRGWIQVGSEEEWNFYWSFTQNCRNIFSIESGYRMNDNQIINHFPNHYELSRKDLLVKNIKRYRKELEREGNPLAEKKEVTLANGQTVTRYIHLDFIPVTYVLPADYNMFVEEYRKSPQSTWIMKPCGKSQGAGIFLINKLSKLKKWSREAKTPLHPQLGSKESYVISRYIDNPLLIGGKKFDLRLYVLITSFRPLKAYLFQHGFCRFCTVKYDTSVTELDNMYVHLTNVSVQKHGGDYNSLHGGKMSIQNFRLYLEGTRGRSVTDKLFADMQWLIVHSLKAVAPVMANDRHCFECYGYDIIIDNALKPWLVEVNASPSLQSTTHSDRILKYKLIDNIVSVVVPPGGIPDARWNKIPTPEALGDFDVLIDEELMEKEDSAPRNSKSNRCKP-