Monarch geneset OGS2.0

DPOGS200865
TranscriptDPOGS200865-TA2259 bp
ProteinDPOGS200865-PA752 aa
Genomic positionDPSCF300071 + 424509-436551
RNAseq coverage757x (Rank: top 17%)
Annotation
HeliconiusHMEL0126354e-17790.14% 
BombyxBGIBMGA009854-TA2e-15984.20% 
Drosophilal(2)NC136-PA2e-12074.01% 
EBI UniRef50UniRef50_F6ZZH31e-16044.27%Uncharacterized protein n=2 Tax=Ciona intestinalis RepID=F6ZZH3_CIOIN
NCBI RefSeqXP_970507.20.052.69%PREDICTED: similar to MGC80612 protein [Tribolium castaneum]
NCBI nr blastpgi|1892358750.052.69%PREDICTED: similar to MGC80612 protein [Tribolium castaneum]
NCBI nr blastxgi|1936458390.052.94%PREDICTED: hypothetical protein LOC100165745 isoform 1 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00056341.1e-263nucleus
GO:00063551.1e-263regulation of transcription, DNA-dependent
KEGG pathwaytca:6590820.0 
 K12580 (CNOT3, NOT3)maps-> RNA degradation
InterPro domain[1-752] IPR0122701.1e-263CCR4-NOT complex, subunit 3/ 5
[3-229] IPR0072077.1e-90Not CCR4-Not complex component, N-terminal
[618-747] IPR0072821.1e-42NOT2/NOT3/NOT5
Orthology groupMCL13904 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200865-TA
ATGGCTGCGACAAGAAAATTACAAGGTGAAATAGACAGGTGTTTAAAAAAGGTCACGGAGGGGGTGGAGACGTTTGAGGACATCTGGCAAAAGGTACACAATGCGACGAACAGTAATCAAAAAGAAAAGTATGAGGCGGATCTCAAAAAGGAGATTAAAAAGCTTCAGAGGCTACGAGATCAGATTAAGTCATGGATCGCCTCGGGCGAAATTAAGGATAAGAGTACACTTTTAGAATATAGGAAACTAATAGAAACGCAAATGGAAAGGTTCAAAGTTGTGGAACGGGAAACAAAAACGAAAGCATACTCTAAAGAAGGGCTGGGTGCGGCGCAAAAGTTGGACCCTGCCCAGAAGGAACGAGAGGAAATGTCATCATGGCTAATATCTTCAATAGATGCACTTAATTTACAGATTGATCTATTTGAGTCTGAAGTTGAGTCACTGTTAGTTGGTAAGAAGAAACGTCTGGACAAGGAGAAACAGGATCGTATGGAGGAACTCAAGCTCAAGTTGGAAAGGCACAGGTTCCACATAAAGAAGCTAGAAACCTTACTCCGAATGCTAGACAACATGTCCGTAGAAGTGGAACAGATAAAGAGAATAAAAGAAGATGTTGAGTACTACATAGTATCATCGTTAGAGCCAGGGTACGAAGAGAATGACTACATCTACGAAGACATTAATGGCCTGGACGAGATCGAGCTCAGTGGAGTGGGACTGCCCTCGTCGGCTACAACGGATAGCAATAATAGTAACGATTCACCCGGTTCACCCACCAGTATACTCTCAGGAACGAGTCCCGTGACGTCACCATCGTTAGACACACACAACCACACGACGGATTCCATAGACGTTGACAAAAAGAAAAAAGAAGATATTACAACTAAACCTATCAAGCCGCTGCCGCTCCGTGCGGTGACGTGCGTCAGTCCGGCTAACGTTAGTTCCTTGCTCAATAACTCCGCCGCATCCAATAGCAGTATAAACAATTCTGTGACGTCGGTGACTTCGCTTTCGGGGTCTTCGACGCCCAGCAAGCCCGCGCCGCCGTCCCCGCACCCCGCCCCGCCCGCGCCGCATCCCGCCCAGACCCTGCCGCCGGCGACGCACCCCGCGCCCCACACACCCGCCTACCCCGTACCCAGGGTACCCGAGGTATTGGAGAATGGTCCCGTGTCGAGCGCTGTCCTTACTCAGCTGCCGGCGCACCCCGTGCTCGTACACGCGTCTCACCCAGTGTCTCACCCCGTGTCGCACCCTACATCGCACCCCGTGTCTCATCCTGTGTCACACCCTGTGTCACACCCCGTGTCACACCCTGCGCCGGCGCCAGCGCCGGTACCTACGTCAAAGAGTTCATCTGTAACGACGTTGTCGTCGTCGACGGCGGTCGTCAACTCGTTGTCTCACAACACGTCCGGAGCCCCGTCGCCAGCGCCGCCGGCGCCCTCGGCCTCTGCCCCCATCCCCGCGACAGCGACCGCGCCCCCACCAGCGACCGCTCTCAACGGACCCACGCTGGCCGTAGCACAGGAACACACGCAGTATGTTAACAATGTGAGGGCGCTGTCTCCGCCGGCGGTGAGCGGGAACACTACCGCCAACAGCATGGACAGCGGCGTCACAGGAACCGCCTCGCTGAAGAGCATGGCCCAGGAGGCCGTGCAGAGAGCGGGGCTCGACCACCACCACACGCAGGCGACGGGTACAGTCGGCTCGCTAACAGGAGGCACGGGCGCCAGGCGAGGCACAGCACTCTCCCAGGCGCTCATACCGCCCATACTGGGAGTGGCGCCGCTGGGACCACTGCCACTTAATAATGACCACCAGGTGCAGTTCCAGATGATGGAGGCGGCGTTCTACCACATGCCGCATCCATCAGACTCGGAGCGCACCCGAGTCTACCTGCCCAGGAATATTTGTCAGACACCGTTATATTACAATCAGGTGTTACTACCCCACTCAGACTCAGTAGAGTTCTTCCAGCGGTTGTCGACGGAGACGCTGTTCTTCGTGTTCTACTACATGGAGGGGACCAAGGCGCAGTACCTGGCGGCAAAAGCGCTCAAGAAGCAGAGCTGGCGCTTCCACACCAAGTACATGATGTGGTTCCAGAGACACGAGGAGCCCAAGGTTATCAATGAGGAATACGAACAGGGCACATACATTTACTTCGACTACGAGAAGTGGGGCCAGCGGAAAAAAGAAGGCTTCACGTTCGAGTACAAGTACTTAGAAGACCGCGACCTGAACTGA

Protein sequence:

>DPOGS200865-PA
MAATRKLQGEIDRCLKKVTEGVETFEDIWQKVHNATNSNQKEKYEADLKKEIKKLQRLRDQIKSWIASGEIKDKSTLLEYRKLIETQMERFKVVERETKTKAYSKEGLGAAQKLDPAQKEREEMSSWLISSIDALNLQIDLFESEVESLLVGKKKRLDKEKQDRMEELKLKLERHRFHIKKLETLLRMLDNMSVEVEQIKRIKEDVEYYIVSSLEPGYEENDYIYEDINGLDEIELSGVGLPSSATTDSNNSNDSPGSPTSILSGTSPVTSPSLDTHNHTTDSIDVDKKKKEDITTKPIKPLPLRAVTCVSPANVSSLLNNSAASNSSINNSVTSVTSLSGSSTPSKPAPPSPHPAPPAPHPAQTLPPATHPAPHTPAYPVPRVPEVLENGPVSSAVLTQLPAHPVLVHASHPVSHPVSHPTSHPVSHPVSHPVSHPVSHPAPAPAPVPTSKSSSVTTLSSSTAVVNSLSHNTSGAPSPAPPAPSASAPIPATATAPPPATALNGPTLAVAQEHTQYVNNVRALSPPAVSGNTTANSMDSGVTGTASLKSMAQEAVQRAGLDHHHTQATGTVGSLTGGTGARRGTALSQALIPPILGVAPLGPLPLNNDHQVQFQMMEAAFYHMPHPSDSERTRVYLPRNICQTPLYYNQVLLPHSDSVEFFQRLSTETLFFVFYYMEGTKAQYLAAKALKKQSWRFHTKYMMWFQRHEEPKVINEEYEQGTYIYFDYEKWGQRKKEGFTFEYKYLEDRDLN-