Monarch geneset OGS2.0

DPOGS210707
TranscriptDPOGS210707-TA1254 bp
ProteinDPOGS210707-PA417 aa
Genomic positionDPSCF300013 - 383710-388169
RNAseq coverage272x (Rank: top 40%)
Annotation
HeliconiusHMEL0088881e-16886.32% 
BombyxBGIBMGA006317-TA0.094.17% 
Drosophilatwin-PC1e-16676.10% 
EBI UniRef50UniRef50_Q16KP33e-17079.67%Carbon catabolite repressor protein n=3 Tax=Endopterygota RepID=Q16KP3_AEDAE
NCBI RefSeqXP_001605640.10.080.22%PREDICTED: similar to GA16037-PA [Nasonia vitripennis]
NCBI nr blastpgi|3800180551e-18079.95%PREDICTED: LOW QUALITY PROTEIN: CCR4-NOT transcription complex subunit 6-like [Apis florea]
NCBI nr blastxgi|3800180551e-17779.95%PREDICTED: LOW QUALITY PROTEIN: CCR4-NOT transcription complex subunit 6-like [Apis florea]
Group
KEGG pathwaynvi:1001220370.0 
 K12603 (CNOT6, CCR4)maps-> RNA degradation
InterPro domain[61-401] IPR0051351.3e-48Endonuclease/exonuclease/phosphatase
Orthology groupMCL10698 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210707-TA
ATGGATGTAGCTGTTGAGGTTTCATGTAGCCCGCGGTGTCACCATAAACTGCCAGTGATTATCGATGCGAGCATTGTTCAGCCAGCGTTTGGCATGTGGCTCGCGTGCCGCACTCATCCGGCATACAATGGACCAAGCACGTGCTGCGGTGCTATTGATCGATCACTGTCGCCAGGTATATTTACGGTGATGTGCTACAATGTACTCTGCGACAAATATGCAACAAGACAGATGTATGGCTACTGTCCTAGCTGGGCTCTCGAGTGGGACTATCGGAAGAAGGGCATCCTTGATGAAATAAGACACTACTCCGCGGACATCATAAGCTTACAGGAAGTTGAAACGGATCAGTTCTATAACTTCTTTCTACCGGAACTGAAACAAGACGGTTATGATGGCATCTTCTCTCCGAAATCCCGAGCGAAGACGATGTCCGAGTCGGAAAGAAAATACGTCGACGGTTGTGCGATATTCTTTAGATCTGCTAAATTCTCACTAGTAAAGGAACACCTTATAGAATTCAATCAGCTAGCGATGGCTAACAGCGAGGGTTCAGACAATATGTTAAATCGAGTCATGCCGAAAGATAATATAGGTTTAGCCGCTCTGCTGAAAACTAAAGAAGCTGCTTGGGAAAACGGCGTACCAACGGATTCGTCAACATTAGCACAACCGATACTAGTTTGTACGGCACACATCCACTGGGATCCGGAGTTCTGTGACGTGAAACTGATCCAGACGATGATGTTGAGCAACGAACTGAAAAGTATTATGGAGGATTCGGCAAGGACACTCCGCCTCAGCGGACAGAAAGACAACGTGCAACTGTTGCTTTGCGGTGATTTTAATTCGTTGCCGGATAGCGGTGTAGTAGAGTTTCTGTCTGCTGGGCGTGTGTCATCTGAGCACCGAGATTTCAAGGAGCTCGGGTACGCGTCATCTCTCCGTCGTATGCCGGGCTCGGAACACGAGTTCACGCACAACTTTAAATTAGCCTCCGCATACAGCGAAGATATCATGCCCTATACTAATTACACGTTCGACTTCAAGGGTATCATCGACTACATATTCTACAGCAAGCAGTCGATGACGCCGCTGGGTCTGTTGGGGCCGCTGTCTCAAGACTGGTTCAGGGAGCACAAGGTGGTGGGCTGTCCACACCCACACATACCATCAGATCATTTCCCACTGTTGGTAGAGTTAGAGATGTACCCGCCATCCGCCAGCAGCAACGGACTGATCGGGCGCAGGTAG

Protein sequence:

>DPOGS210707-PA
MDVAVEVSCSPRCHHKLPVIIDASIVQPAFGMWLACRTHPAYNGPSTCCGAIDRSLSPGIFTVMCYNVLCDKYATRQMYGYCPSWALEWDYRKKGILDEIRHYSADIISLQEVETDQFYNFFLPELKQDGYDGIFSPKSRAKTMSESERKYVDGCAIFFRSAKFSLVKEHLIEFNQLAMANSEGSDNMLNRVMPKDNIGLAALLKTKEAAWENGVPTDSSTLAQPILVCTAHIHWDPEFCDVKLIQTMMLSNELKSIMEDSARTLRLSGQKDNVQLLLCGDFNSLPDSGVVEFLSAGRVSSEHRDFKELGYASSLRRMPGSEHEFTHNFKLASAYSEDIMPYTNYTFDFKGIIDYIFYSKQSMTPLGLLGPLSQDWFREHKVVGCPHPHIPSDHFPLLVELEMYPPSASSNGLIGRR-