Monarch geneset OGS2.0

DPOGS207567
TranscriptDPOGS207567-TA1725 bp
ProteinDPOGS207567-PA574 aa
Genomic positionDPSCF300072 - 648157-660068
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0171510.087.29% 
BombyxBGIBMGA004723-TA2e-11088.89% 
DrosophilaCG31108-PA2e-4836.01% 
EBI UniRef50UniRef50_UPI00021A7BAA4e-13645.75%UPI00021A7BAA related cluster n=3 Tax=unknown RepID=UPI00021A7BAA
NCBI RefSeqXP_001812872.11e-13945.82%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|3838631802e-13846.37%PREDICTED: probable tubulin polyglutamylase TTLL2-like [Megachile rotundata]
NCBI nr blastxgi|2700078989e-13946.27%hypothetical protein TcasGA2_TC014642 [Tribolium castaneum]
Group
Gene OntologyGO:00064647.9e-126protein modification process
GO:00048357.9e-126tubulin-tyrosine ligase activity
KEGG pathway 
InterPro domain[13-346] IPR0043447.9e-126Tubulin-tyrosine ligase
Orthology groupMCL16937 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207567-TA
ATGTTAGAGATGGCTCAAGTATTTGATAGTGATGGTCCATTTATTTATCGCTTAAATGATAACGGTTCCGGACCCAGTCTTCTAGTTAAGATATTTACTGAACGTGGTTGGCGAATATATCAGGGCTACAATGGTGCAGAGGAGAGATGGAATCTTTGGTGGAGGACAAGCGCTTTTCCAGCTACTTCCTACAAAGCCTTGGGAGACTGGCAGTTTATGAACCACATACCGAAAGGCGGTTCAATTTGTCGGAAGGACAGCTTGTCGAGATTGCTCCGATGTATGAGAAGAATATACGGATCTATTTATGACTTCAGCCCGCCTTGTTTCCACCTACCTCTGGAGTACGCAAAACTGGTTTCCGAGTGCTCAAGACTCAGGCGCGGCGATGATCCCACCAGCTCGAACGTGGTGTGGATACACAAACCAGTGGCGCAGAGTCAAGGACGGGGGATCTTCCTTTTTAGGAGTGTCTGTGACATGAGATGCGGTAGTCCCGCCGTAGTACAGAGGTATATCGAGCGCCCTCTATTGATAGCGGGATACAAGTTTGATCTGAGATTGTACGTATGTGTTCCCGGTTATCGTCCGTTAACGGCTTATATGTACGCAGAGGGATTGGCGAGGTTTGGAACGGATAAGTACACTTTATCTGACATCCACAATCCCTACCGACATCTTACAAATTCATCTCTTAACAAAACTGGGCCGCGGTACGCAGAGTGCAAGGATAGAATAGGCAGTGGTTGTAAGTGGACGCTGAAGCAAGTTAGGCGAGCACTGGTCGGAAGATGGGGTGCTGTGGAATGGTTGGTTTGGCAAAGAATACGAGCGTTGGTTACACTGACTTTACTCGCTCAAGCTGCTGGAACTCCACCGGCAAGGAATTGTTTCGAGTTTTACGGGTTCGATGTACTACTTGATGATTGTTTGAAGCCTTGGCTTATTGAGGTAAATTTATCACCCGCTTTGGCTGCTGACTGTGAAGCTGATGTGACAGTGAAGCAGCCAATGTTACACGAGCTATTTGACCTTTTGGGTTTACCTATGCGTCACACGGGGTTGTCCTTGCTACAGGGGCCACCGACTCCACATATAAGCTGCAGTTCTGAAGAGGAGAATAGTTCTGCTAAGACTGTTGGACGAGGACCGACTGGGCCGCGTATAGGGCGACGAGTGAGAACACGAAAAAGACGAGCCATGCCCTTACACTGCGTTACTCTACAAGCTCCAATGACCGAAACTATTACAGATTTAGCGGAAAAAACAAAAGATATACAAATGGGGCGATTATACAAAACGACGTCAGATGTGAGCCCGGAGAGTTCAGAATCACAATGTTCGACGACAGTAATTCCCTCAACGGAGCCTATAGATGAATCTTGGCGTGGTGGGTATTCAACGGCTGCTTCTCGACGTCGGATGATGGCCGCTTGCTCGTGGGGTAACGGTGTCCGATGGGACCGTGGGGTCGGACGAGTGGGTCACTGGGTCCGGATATATCCCCACTCGCTGCCAGCTGATGATCAGCTACCAGGCGAAGACGTCAGGGAGAGTGTAGCTCAAGTGTCTAAATTTTTGAGAGCAGCACGTGAAGTCGCGAAAGACGGAGGCAGAGACAAAAAAGCATCTCGAGATGGAACAAGAGATTCGATTTTTGAAGTCACCCTCCGTAAAAAACTCGATTATGACCATAACTTTGAAGTCTGGCTGCCACCTTTCTAG

Protein sequence:

>DPOGS207567-PA
MLEMAQVFDSDGPFIYRLNDNGSGPSLLVKIFTERGWRIYQGYNGAEERWNLWWRTSAFPATSYKALGDWQFMNHIPKGGSICRKDSLSRLLRCMRRIYGSIYDFSPPCFHLPLEYAKLVSECSRLRRGDDPTSSNVVWIHKPVAQSQGRGIFLFRSVCDMRCGSPAVVQRYIERPLLIAGYKFDLRLYVCVPGYRPLTAYMYAEGLARFGTDKYTLSDIHNPYRHLTNSSLNKTGPRYAECKDRIGSGCKWTLKQVRRALVGRWGAVEWLVWQRIRALVTLTLLAQAAGTPPARNCFEFYGFDVLLDDCLKPWLIEVNLSPALAADCEADVTVKQPMLHELFDLLGLPMRHTGLSLLQGPPTPHISCSSEEENSSAKTVGRGPTGPRIGRRVRTRKRRAMPLHCVTLQAPMTETITDLAEKTKDIQMGRLYKTTSDVSPESSESQCSTTVIPSTEPIDESWRGGYSTAASRRRMMAACSWGNGVRWDRGVGRVGHWVRIYPHSLPADDQLPGEDVRESVAQVSKFLRAAREVAKDGGRDKKASRDGTRDSIFEVTLRKKLDYDHNFEVWLPPF-