Monarch geneset OGS2.0

DPOGS202340
TranscriptDPOGS202340-TA1833 bp
ProteinDPOGS202340-PA610 aa
Genomic positionDPSCF300032 + 877880-879712
RNAseq coverage120x (Rank: top 57%)
Annotation
HeliconiusHMEL0100380.064.90% 
BombyxBGIBMGA004837-TA0.070.42% 
DrosophilaCG1550-PA0.048.86% 
EBI UniRef50UniRef50_UPI00022CA4610.054.20%UPI00022CA461 related cluster n=2 Tax=unknown RepID=UPI00022CA461
NCBI RefSeqXP_394564.10.056.01%PREDICTED: similar to CG1550-PA [Apis mellifera]
NCBI nr blastpgi|3800260050.056.18%PREDICTED: tubulin--tyrosine ligase-like protein 12-like [Apis florea]
NCBI nr blastxgi|3838543040.056.32%PREDICTED: tubulin--tyrosine ligase-like protein 12-like [Megachile rotundata]
Group
Gene OntologyGO:00064645e-192protein modification process
GO:00048355e-192tubulin-tyrosine ligase activity
KEGG pathway 
InterPro domain[1-597] IPR0043445e-192Tubulin-tyrosine ligase
Orthology groupMCL14980 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202340-TA
ATGGATGGCATATCGAGTTATAACTTATTCATAGCAGCACATAAGCCGCAACTCGTTCTGTCCGGAGTCCCGGAACATTTTTGGCCGGTTTTATGTAAGAAACTTAAAGATCAAATATTTGATTCGGGCTTATCCTTTCAGTTAGTTAAAATAGATTACGAGGACGATGAAAAAGAATCTTATGACCCGTTGTGGAGTGTTATGGCTATCAGTGACTTAGATCCTACAGATTCCAGTAATATTTTCCTAGTAGATCATGCATGGACTTTCAAGGCTAACACTATTAAAAGTAGTCTTAGAAATATGCCGGCCTTATTAGAAAGGATGTGTAATCTTATGCAAATTGTTAGAGAAACAACTGAGGAACAAATAGAGGAAGTGTATCAAAATGTATGGAAATATGCAAATACTTATTCTGTTGGAGGAGACGAGCTATCAGTTGAAGACAGAGTACCAGTGTGGTATTTAATGGATGAGTTGGGTTCAGCTATCAATCATTCTGATCAGCCAAATTTCAGAGCTGTACCATTTATATATATACCCGAGCAAATCACATATACTTTACTTTTCCCAATAGAAAATGTTGAAGAAGGTGATTTGATAACAAGAAACTTCATAGAGGGCAAATTTTCAGATCCAAAACAAAGAGAAGCCATGCTGATTCCATGGAAGTATTATGAACATTTTGATGACAATTTCAGTCAAACCGAACCTGATATTAACTATTTTTTAGAGGGGCATATTGTTGAGTCACTACCTGATTTAGATAATCTGCAAAGCAAAGTAAATGTATCAGGAAAGCTTAAAGTTTTTTCAGAATATGAATATATAAATCAGTATCTGACAGCAGATGAGTTTCAAATAGTTGACAGTGAAAGTGAAGCAGATATATTATGGTATATAGAACACTATAAAACTTTTAAGGAATTAAGTGTTAACTCCCCCAACAAGTTTGTCAACCAATTCCCTTTTGAATATGTTGTTACAATAAAAGATATTTTGGCTATCATAGCAAGGAGAAACAATAAAATCTATGGGAATGCAGAACTTGAAACGTTTCCCACTTGGTTACCAACAACTTTTAACATGAAGACTGAGCTTTCGAAGTTAGTGGCTTATTACATGCAGAGAAAAAAACAAGGTCTGGATAATCACTGGATTTGTAAACCTTACAATCTTGCGAGGGGTTTAGATACATATATTACTGATAACTTAGATTTTCTATGCAGGCTTCCACTGTCAGGCCCCAAGATTGCTCAAAAATATATTGAAAATCCTGTCCTATTTGACAGAGCTGATATCGGGAAGGTTAAATTTGATATTCGTTATGTTGTTCTTCTCAAATCTGTTAATCCAACTGAAGTTTTTGTGTACAATAATTTCTTCTTACGGCTGTCTAATAAAGAATTTTCAATGGACAACTTTGATGATTATGAAAAACATTTTACTGTAATGAATTACACTGAAGGAGCGCCGTTGTTTAAATTATTATGTGAAGATTTTAAAAATGCCTGGTCTAACCAGTACCCAGATTACGAATGGGATGAAGTAGAGCGGTCTATATTTAATATGCTGTCTGAATTGTTTACATGTGCTACTGCCAAAGAACCCCCTCTTGGAATAGCGAGAAGTCCCCAATCAAGAGCTTTGTATGCAGTCGATGTGATGTTGAGTTGGGATAAAAATGGTACTGTCATACATCCAAAGCTACTAGAATTAAATTGGATGCCTGATTGTCGAAGAGCTTGTGAATATTATCCTGACTTTTACAATGACATATTCTCTGTGCTATTTTTAGACAAAACTGTTGGGTCTTGTACCAAGATCATGTAA

Protein sequence:

>DPOGS202340-PA
MDGISSYNLFIAAHKPQLVLSGVPEHFWPVLCKKLKDQIFDSGLSFQLVKIDYEDDEKESYDPLWSVMAISDLDPTDSSNIFLVDHAWTFKANTIKSSLRNMPALLERMCNLMQIVRETTEEQIEEVYQNVWKYANTYSVGGDELSVEDRVPVWYLMDELGSAINHSDQPNFRAVPFIYIPEQITYTLLFPIENVEEGDLITRNFIEGKFSDPKQREAMLIPWKYYEHFDDNFSQTEPDINYFLEGHIVESLPDLDNLQSKVNVSGKLKVFSEYEYINQYLTADEFQIVDSESEADILWYIEHYKTFKELSVNSPNKFVNQFPFEYVVTIKDILAIIARRNNKIYGNAELETFPTWLPTTFNMKTELSKLVAYYMQRKKQGLDNHWICKPYNLARGLDTYITDNLDFLCRLPLSGPKIAQKYIENPVLFDRADIGKVKFDIRYVVLLKSVNPTEVFVYNNFFLRLSNKEFSMDNFDDYEKHFTVMNYTEGAPLFKLLCEDFKNAWSNQYPDYEWDEVERSIFNMLSELFTCATAKEPPLGIARSPQSRALYAVDVMLSWDKNGTVIHPKLLELNWMPDCRRACEYYPDFYNDIFSVLFLDKTVGSCTKIM-