Monarch geneset OGS2.0

DPOGS202721
TranscriptDPOGS202721-TA1539 bp
ProteinDPOGS202721-PA512 aa
Genomic positionDPSCF300272 + 174000-179047
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0078882e-8668.16% 
BombyxBGIBMGA008432-TA8e-18067.71% 
DrosophilaCG4089-PA7e-11548.59% 
EBI UniRef50UniRef50_D6WP531e-13249.48%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WP53_TRICA
NCBI RefSeqXP_001600788.11e-13450.44%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|910834834e-13249.48%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910834833e-13248.77%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00064642.7e-142protein modification process
GO:00048352.7e-142tubulin-tyrosine ligase activity
KEGG pathway 
InterPro domain[56-511] IPR0043442.7e-142Tubulin-tyrosine ligase
Orthology groupMCL16501 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202721-TA
ATGACGAAGGCTCAGACTAAACAAATGTTGGATAAAAACGAGAAAATATCATATAATAATAACGAAAAAGATATCACAAACAATGACAAGCGACGTACGGATAAGGACACAAATAAAAATATATTTCTTTTAATATGTATAATAGGAGTATCTTTGGCATTTCTATTGGAAATATTAAATGTAAGGAATAGATATGAAAAGGTATGTAAGGACGACAAAAAACTTTACTGGGTTTATTCTGCCTACAGTGATGTTGAAAATAGAAAAGGTCTTTTGAAACATGTACATTTAGTATTAGAGAGAATTGGATATGAGAAAAGTACGAACAAAACGCCGTGGACGCTACTGTGGTCCCACGACTTCCCGTTCCGAGCACTCTATCCGAATCTGCACAGTTTGAAACCAAATCAGAAAGTCAATCACTTTCCTGGCACAGGATTCATAACAAACAAAGTCGATCTGGCGACTTCCGAGTCGAAATACATTCCGCGAGCGTTCAAGCTGCCGAGGAATAAAGCGGAGTTCGTTAAATACGCCAGTTTGAACAAAGATGCAATGTTTTTGGAAAAGCACAATCAGCATAGAGGTGTGTATTTGAAAAACGTGACTGCAATAGACTCCGACAGTGGCGAAAGTTTCGTGCAAGAGTTCATACAAAGACCGTTCCTCGTGGATGGACACAAGTTCGACATTGGAGTGTACGTTGTTCTGACATCCGTTAATCCATTAAGAGTTTATTGGTATAAAGGAGATGTGCTGTTCAGGTATTGTCCAGCGAAGTACTATCCATTTGATGCCAACAATCTTGATAAGTACGTCATCGGTGATGACTACCTTCCGACCTGGGAGGTGCCGTCCCTGGCACAGCCTTATGCACTCGGATACTCAATGAAAGACGCTTTCGATCATTACGCCAAAACCAAGGGTTTAGACACAACTCGCATGTGGAAGGATGTCCAGGAAGCAATCACCGAAGTTTTTATTAAAAAAGAACATCACATAGTAGAGGCTTTAAAGAATTACCCATCGCAGGACAACTTCTTTGAAATGATGCGTTTTGATCTAGTTGTAGATGAGAATCTCAAGGTCTATCTGCTGGAAGCCAACATGTCCCCCAACCTTAGCTCGGCACATTTCCCGCCGAACCAACTTTTGTATGAGCAGGTTCTATACAACCTATTCTCTCTGGTCGGCGTCGCTTCTCACAGTGATATTACTGACAATAATGTGCGAAATATGATGTCGTCACAAAAGAACATAGCTGTATACAGCGAAGAATGCAATTCTATTTGTAGGGATGATTGTGCGGTATTGGACATTTGCAAACTTTGCAGACCCTGTCTAAGCACTAGATTAAGGTCTAACCTATTAAATGCACATAGGGAAAATTTACATCAAGGAGATTTTAGGAGACTGTTTCCTCCGGCTATGGAACCTCGGGAGAGTGCGGGAAATGTAACGAAAATTCTGAACGAAGCGAACAGGCTGCAATATTTATGGTATCAAGGAAAATGTAACGATGACGTCAGTTGGTTATAA

Protein sequence:

>DPOGS202721-PA
MTKAQTKQMLDKNEKISYNNNEKDITNNDKRRTDKDTNKNIFLLICIIGVSLAFLLEILNVRNRYEKVCKDDKKLYWVYSAYSDVENRKGLLKHVHLVLERIGYEKSTNKTPWTLLWSHDFPFRALYPNLHSLKPNQKVNHFPGTGFITNKVDLATSESKYIPRAFKLPRNKAEFVKYASLNKDAMFLEKHNQHRGVYLKNVTAIDSDSGESFVQEFIQRPFLVDGHKFDIGVYVVLTSVNPLRVYWYKGDVLFRYCPAKYYPFDANNLDKYVIGDDYLPTWEVPSLAQPYALGYSMKDAFDHYAKTKGLDTTRMWKDVQEAITEVFIKKEHHIVEALKNYPSQDNFFEMMRFDLVVDENLKVYLLEANMSPNLSSAHFPPNQLLYEQVLYNLFSLVGVASHSDITDNNVRNMMSSQKNIAVYSEECNSICRDDCAVLDICKLCRPCLSTRLRSNLLNAHRENLHQGDFRRLFPPAMEPRESAGNVTKILNEANRLQYLWYQGKCNDDVSWL-