Monarch geneset OGS2.0

DPOGS206420
TranscriptDPOGS206420-TA927 bp
ProteinDPOGS206420-PA308 aa
Genomic positionDPSCF300181 + 47056-49962
RNAseq coverage300x (Rank: top 37%)
Annotation
HeliconiusHMEL0029062e-7890.07% 
BombyxBGIBMGA013786-TA6e-12075.17% 
DrosophilaMat1-PA2e-11060.70% 
EBI UniRef50UniRef50_E2BZE42e-11163.99%CDK-activating kinase assembly factor MAT1 n=11 Tax=Bilateria RepID=E2BZE4_HARSA
NCBI RefSeqXP_396068.15e-11965.48%PREDICTED: similar to Mat1 CG7614-PA [Apis mellifera]
NCBI nr blastpgi|481062209e-11865.48%PREDICTED: CDK-activating kinase assembly factor MAT1-like isoform 2 [Apis mellifera]
NCBI nr blastxgi|3504179609e-11767.31%PREDICTED: CDK-activating kinase assembly factor MAT1-like [Bombus impatiens]
Group
Gene OntologyGO:00056346.4e-165nucleus
GO:00070496.4e-165cell cycle
KEGG pathwayame:4126131e-118 
 K10842 (MNAT1)maps-> Nucleotide excision repair
InterPro domain[1-306] IPR0045756.4e-165Cdk-activating kinase assembly factor (MAT1)
[1-309] IPR0163903.9e-147Cdk-activating kinase assembly factor (MAT1), metazoa
[53-251] IPR0158771.2e-55Cdk-activating kinase assembly factor MAT1, centre
[2-63] IPR0130834.9e-12Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL13279 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206420-TA
ATGGATGATCAAGCATGTCCCCGTTGTAAAACAACGAAATACAGAAATCCATCCCTAAAGTTGATGGTAAACATTTGTGGCCATGCTTTGTGCGAGAGCTGTGTTGATTTATTGTTTTTAAAAGGATCTGGTTCATGTCCTGATTGCAATGTTCCTTTGCGTCGTAGTAATTTTCGTGTACAGCTTTTCGAAGATTCCATGGTGGAAAAAGAAATGGATATAAGAAAACGTGTTCTCAAGGACTTTAACAAAAAAGAAGAGGATTTCTCAACACTCAGAGAATATAACGATTATTTAGAAGAAATAGAAGTAATAATATATAATTTAGTCAATAACATAGATGTGGTCGGAACAAACAAAAGGATAGAACAATATAAAAGGGATAATAAAGAACTTATTATGAAAAACAAAGCCAAAATCGGTAGGGAAGAAATAGAATTAGAGGAGATATTGGAAATTGAAAAGCAAATGGAGGAATTAAGACGTCAGGAAATAGCTAAGATGGAGGATGAGGCGAAGAAACAGAAAATAAGAGCAAAGGAAGCTTTGATTGATGAGTTAATGTTCGCCGACGGAGACGCTAAGGATATATTGAACACATTTGCACAAACTGTGGCTAATAAGCAAGAGGAAGTTGTGCCGCTGCTACCTAAAGTGACACAGTTCTCATCGGGTGTGAAATTTACTAGAGGTTCGAGTCAGGCAATACCTATAATAGAAGAAGGGCCGCTTTACAAATATGAACCGTTAGAAATACCTGATAGATGTGGACCGGATCCACCGTCGTTGGAGGAGATTATGAATAACGGGTTTCTGCATCACGTTAGAGCAGAGAACGAGACAGAGAAAGCTGGTGGTTATACATCTACTCTACCGTGTCTGAGAGCACTCCAAGATGCACTCTCCGGCCTCTACCACGCCAGCTGA

Protein sequence:

>DPOGS206420-PA
MDDQACPRCKTTKYRNPSLKLMVNICGHALCESCVDLLFLKGSGSCPDCNVPLRRSNFRVQLFEDSMVEKEMDIRKRVLKDFNKKEEDFSTLREYNDYLEEIEVIIYNLVNNIDVVGTNKRIEQYKRDNKELIMKNKAKIGREEIELEEILEIEKQMEELRRQEIAKMEDEAKKQKIRAKEALIDELMFADGDAKDILNTFAQTVANKQEEVVPLLPKVTQFSSGVKFTRGSSQAIPIIEEGPLYKYEPLEIPDRCGPDPPSLEEIMNNGFLHHVRAENETEKAGGYTSTLPCLRALQDALSGLYHAS-