Monarch geneset OGS2.0

DPOGS202506
TranscriptDPOGS202506-TA1971 bp
ProteinDPOGS202506-PA656 aa
Genomic positionDPSCF300131 - 345883-351820
RNAseq coverage197x (Rank: top 47%)
Annotation
HeliconiusHMEL0074381e-17379.25% 
BombyxBGIBMGA001426-TA0.078.28% 
DrosophilaCG5987-PA6e-14148.79% 
EBI UniRef50UniRef50_UPI00021A652D0.054.84%UPI00021A652D related cluster n=3 Tax=unknown RepID=UPI00021A652D
NCBI RefSeqXP_975348.13e-17352.92%PREDICTED: similar to tubulin tyrosine ligase-like family, member 6 [Tribolium castaneum]
NCBI nr blastpgi|3838521410.055.77%PREDICTED: uncharacterized protein LOC100876864 [Megachile rotundata]
NCBI nr blastxgi|3838521410.055.65%PREDICTED: uncharacterized protein LOC100876864 [Megachile rotundata]
Group
Gene OntologyGO:00064642.8e-191protein modification process
GO:00048352.8e-191tubulin-tyrosine ligase activity
KEGG pathway 
InterPro domain[88-532] IPR0043442.8e-191Tubulin-tyrosine ligase
Orthology groupMCL12431 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202506-TA
ATGGCGCAGTTAGATTTGCATGACACTTTCTTAAACGATGAAGACCACCTGCATATTACAAAACAAACAGGAACATTATCCAAAAGAAAAACTGATAAAGAAGAGAAAATATCGCCAGAACATTTAGTTAAAGAGGATATGGCAGACAAAATTGCAACAATTCTTATGCACAACTATGATGGCAAATTGTATGGAGAAGATGATACATCTCAGCCAACTTCATGTACTGCACCAATTATACCTATTGAAGCTACACGTAAAAAAAAGAAAAGGAAGAGATCTCAGATATCAATATGCCTAACAAACTGTCGTTATGAGTCGATCCGTAAGGTGGCAAGTGCATTTGGTATGAGAGAGGTCTCTGAAGAGGAGGCTTGGAACTTTTACTGGACTGATATGAGCGTGTCTGTGGAACGAGCCAAGGAGATGAAGCGGTTCCAGAGGATAAATCACTTCCCCGGCATGCTTGAGATATGCAGAAAGGATTTGTTGGCAAGAAACTTAAACAGAATGCAAAAAATATATCCCAAAGAGTATAACTTCTTTCCAAAGACCTGGTGCCTGCCTGCCGATTTTGGCGAGGCACTAAACTATAGTAAATCTAGAAAGAACAAGACGTTCATAATAAAGCCGGAGTGCGGGAGTCAGGGGCGAGGCATCTACCTCACGAAGTCACTTAAGGACATCAAGCCGACGGATAAATTAATTTGTCAGGTGTACTTATCAAAGCCGTACTTAGTGGACGGTTACAAGTTTGATATCAGAGTGTACACTCTCATAACGTCCTGCGACCCTCTACGGATATTCGTCTACAACGAGGGTCTTGTGAGATTCGCGACCAGCCGCTACGCGGATCCCAATGTGAACAACACTACGAACGTGTTCATGCATCTGACGAACTACGCCCTCAACAAACACAGTCGCACATATGTCTATGACTCGGAGGCTGGCAGCAAACGCAAGATATCAACTTTGAACAAGATCCTGCTGTCGCAAGGAGTGGATTTGGACCGTCTGTGGCACTCCATCGACCAAGTTATAGTGAAGACTATTATATCAGCGTGGCCCATACTGAAACACAGCTACCACGCTTGCTTCCCCTCGCATGATATGGTACATGCCTGCTTCGAAATATTGGGCTTCGACATTTTACTGGATCACAAGCTCCATCCTTTCATTCTAGAAGTGAACCATTCGCCAAGTTTCCACACAGACACACAGTTGGATCGTGAAGTGAAAGAGGGACTGCTGACGGACACACTCACCATGCTAAATATATGGCAATGTGACAAGAAGAGAGTCCTGGAGGAAGATCGCAAGAGAATACGAGACAGACTGCTGCAGACAAACAAGTTTGCGGAATACGCTCCGACAGAGGAAAAAGAGCCGGAGAAGAGTCCCTGGCAGACTCAGATACAGTGGGAAGAGACACATCTCGGTAACTTCCGACGCGTGTATCCTGTGGGCGAGCAGTACGCCAGTCTGTTCCAACAGCCCTCGGGTTCATTATACACGGGCACGGCCTCCTCACGGGCGAGGGGAGATTGCGCACGGTTACAGAGAGAGGAATTCCAGCAAACGAAAGCAAAAGCAGAAGCTCTAAAGAAACCTTCGCAACCCAAGACCAAAGAAGACGATCAGAAACCTAAAGAAAAGAAAGAGAAGGAAAAGGAGGGAGAAGGTGGTAGTATGAGCATCAGCGTTACAACTGACAAGGGTAAGGTTAAAGTTAAGGAACAAAAGAAGAAGGAAGATAAGCCGAAAGTCGCTTCCAAACCGTCCTTGATGCAACAAACAGTTTCACCTACAGAGGAAAAAGAGAAGGAGCCGAAGCCGTCTTACGTGACGTGTTCGTATGAGCCGGACGCTATCGTTGAGAGAGAAGAAAGAGAAAGGCTCAATCTATTGGCTCAGAGGGACTTCCTCATACGCAGCTATGGGATCAACCTTGTCCATAATCTTGTCTTGTAA

Protein sequence:

>DPOGS202506-PA
MAQLDLHDTFLNDEDHLHITKQTGTLSKRKTDKEEKISPEHLVKEDMADKIATILMHNYDGKLYGEDDTSQPTSCTAPIIPIEATRKKKKRKRSQISICLTNCRYESIRKVASAFGMREVSEEEAWNFYWTDMSVSVERAKEMKRFQRINHFPGMLEICRKDLLARNLNRMQKIYPKEYNFFPKTWCLPADFGEALNYSKSRKNKTFIIKPECGSQGRGIYLTKSLKDIKPTDKLICQVYLSKPYLVDGYKFDIRVYTLITSCDPLRIFVYNEGLVRFATSRYADPNVNNTTNVFMHLTNYALNKHSRTYVYDSEAGSKRKISTLNKILLSQGVDLDRLWHSIDQVIVKTIISAWPILKHSYHACFPSHDMVHACFEILGFDILLDHKLHPFILEVNHSPSFHTDTQLDREVKEGLLTDTLTMLNIWQCDKKRVLEEDRKRIRDRLLQTNKFAEYAPTEEKEPEKSPWQTQIQWEETHLGNFRRVYPVGEQYASLFQQPSGSLYTGTASSRARGDCARLQREEFQQTKAKAEALKKPSQPKTKEDDQKPKEKKEKEKEGEGGSMSISVTTDKGKVKVKEQKKKEDKPKVASKPSLMQQTVSPTEEKEKEPKPSYVTCSYEPDAIVEREERERLNLLAQRDFLIRSYGINLVHNLVL-