Monarch geneset OGS2.0

DPOGS204689
TranscriptDPOGS204689-TA1194 bp
ProteinDPOGS204689-PA397 aa
Genomic positionDPSCF300170 + 94819-99229
RNAseq coverage985x (Rank: top 13%)
Annotation
HeliconiusHMEL0146441e-12970.69% 
BombyxBGIBMGA010136-TA3e-15686.97% 
DrosophilaRga-PA4e-7459.51% 
EBI UniRef50UniRef50_E2BZ691e-13259.16%CCR4-NOT transcription complex subunit 2 n=26 Tax=Metazoa RepID=E2BZ69_HARSA
NCBI RefSeqXP_625204.11e-13458.66%PREDICTED: similar to CCR4-NOT transcription complex subunit 2 (CCR4-associated factor 2) [Apis mellifera]
NCBI nr blastpgi|3071661533e-13558.30%CCR4-NOT transcription complex subunit 2 [Camponotus floridanus]
NCBI nr blastxgi|3838529444e-13658.39%PREDICTED: regulator of gene activity-like [Megachile rotundata]
Group
Gene OntologyGO:00056345.5e-32nucleus
GO:00063555.5e-32regulation of transcription, DNA-dependent
KEGG pathwayame:5528264e-134 
 K12605 (CNOT2, NOT2)maps-> RNA degradation
InterPro domain[260-386] IPR0072825.5e-32NOT2/NOT3/NOT5
Orthology groupMCL13929 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204689-TA
ATGGCAAACTTGAATTTCCAACAAGCTCCTAGAAGCCTAGCAAGTGGCGGGGTCGGTGGACGGGTTGGTTCAGGTTTGGTTGGCGGAGTATCAGGACATGTGACACCGACATTTGGTGGCGCCCTGTCACCAGGACGGGGCTCCACTGCGATGCCAGGGGGCGCACCACCCTCAATGGCATCTAGATCTACATTATTTGGACAGCGAGCATTTGCTGACCGTAGAGTACCTATACCGACACCACTATCACAAGCTAACTCAAATTCTATGTCAAGTATGGCGAATCTCTCCCGTTTTAATGCGAACTATCATTCCGTGTTCGGCGAGGGCGGTGACACTTCAACACCGCCACTGCTTGACCTCAGCGAGTTTCCATCACTAACGGCCCGGGGTGCCGGCGACCAAGCGCCCGCGGCTGCGCCCCCACCACCCGGCTCAAAGCCCTACGTTGGTATGGTCAAACAGCCAACGTCGGAGCAGTCTGAATTCACTATGTCTTCAGAAGACTTCCCTGCCTTACCCGGCACATCCACGGGAGCTGCGCCTCCTGACAAGCCCACGGATTACACTCATACAGACAAACCTAGAAAAGGCATACAAACCTCCCCTGACGGCAAAGTAACAAATATACCAGAGACAATGATTCCCAATCAGTTTGGGATAGTGGGTCTATTGACATTTATAAGGGCTGCCGAGTCGGACCCCAGTCTAGTCTCGTTGGCACTGGGTCAAGACCTCACAGCCCTGGGCCTCAATCTCAATTCTCCAGACAATCTATACCTTACCTTCGCAGGACCTTGGGCTGATACGCCTTGCAGGCCACAGGACATGGATTACCACGTGCCTCCAGAGTACCTGATAAATGGGTCTATTAGAGAAAAGCTCGCACCCTTACGACTGAGTCGGTATAAGGAAGACTTATTATTTTATCTATTCTACTGCTTCGTTGGCGATGTACTTCAAATTGCGGCAGCGGCGGAACTCTACAACCGCGAGTGGAGGTATCACATGGAAGAGAAGGTCTGGATATCTCAAGCCCCCGGCATGCCCATGGTTGAGAAAACATCCACCTACGAACGCGGCACTTACTACTTCTTTGACGCGCACAATTGGCGCAAGGTGGCGAAAGAATTCCACTTAGACTACAGCAAGTTGGAGGGTCGGCCGCAATTGCCGCCGCATGTGCTCACGTAG

Protein sequence:

>DPOGS204689-PA
MANLNFQQAPRSLASGGVGGRVGSGLVGGVSGHVTPTFGGALSPGRGSTAMPGGAPPSMASRSTLFGQRAFADRRVPIPTPLSQANSNSMSSMANLSRFNANYHSVFGEGGDTSTPPLLDLSEFPSLTARGAGDQAPAAAPPPPGSKPYVGMVKQPTSEQSEFTMSSEDFPALPGTSTGAAPPDKPTDYTHTDKPRKGIQTSPDGKVTNIPETMIPNQFGIVGLLTFIRAAESDPSLVSLALGQDLTALGLNLNSPDNLYLTFAGPWADTPCRPQDMDYHVPPEYLINGSIREKLAPLRLSRYKEDLLFYLFYCFVGDVLQIAAAAELYNREWRYHMEEKVWISQAPGMPMVEKTSTYERGTYYFFDAHNWRKVAKEFHLDYSKLEGRPQLPPHVLT-