Monarch geneset OGS2.0

DPOGS211365
TranscriptDPOGS211365-TA1611 bp
ProteinDPOGS211365-PA536 aa
Genomic positionDPSCF300173 + 736882-739877
RNAseq coverage371x (Rank: top 32%)
Annotation
HeliconiusHMEL0027882e-17258.60% 
BombyxBGIBMGA008459-TA0.088.58% 
Drosophilacrn-PA3e-15666.91% 
EBI UniRef50UniRef50_P178864e-15466.91%Protein crooked neck n=76 Tax=Eukaryota RepID=CRN_DROME
NCBI RefSeqXP_624146.22e-17071.15%PREDICTED: similar to crooked neck CG3193-PA [Apis mellifera]
NCBI nr blastpgi|3504236473e-17071.39%PREDICTED: protein crooked neck-like [Bombus impatiens]
NCBI nr blastxgi|3504236479e-17371.39%PREDICTED: protein crooked neck-like [Bombus impatiens]
Group
Gene OntologyGO:00063961.5e-18RNA processing
GO:00056221.5e-18intracellular
GO:00054889.4e-15binding
KEGG pathwayame:5517565e-170 
 K12869 (CRN, CRNKL1, CLF1, SYF3)maps-> Spliceosome
InterPro domain[185-215] IPR0031071.5e-18RNA-processing protein, HAT helix
[70-232] IPR0119909.4e-15Tetratricopeptide-like helical
Orthology groupMCL11484 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211365-TA
ATGCCGAAGGTAGCAAAAGTAAAAAACAAAGCTCCTGCAGAAATACAGATCACTGCCGAACAGCTCCTACGAGAAGCCAAAGAACGTGATTTAGAAATATTACCACCTCCACCAAAACAAAAGATATCTGATCCAGAGGAATTAAGAGAATATCAACATCGTAAAAGAAAAGCGTTTGAGGATAACATCAGAAAGAACAGACTTGTTATTGGTAATTGGCTCAAATATGCACAATGGGAGGAGTCACAGAAACAAGTACAAAGAGCTCGGTCTATCTATGAAAGGGCACTGGATGTGGATCACCGAAATGTCACCCTTTGGTTGAAGTACACTGAAATGGAAATGCGTAACAGACAAGTGAACCATGCTCGTAACCTGTGGGACAGAGCAGTCACCATTTTGCCGAGGGTTTCACAGTTTTGGTATAAATATACGTATATGGAAGAAATGTTAGAAAATGTAGCTGGAGCAAGACAGGTTTTTGAACGATGGATGGAATGGCAACCGGATGAGCAAGCCTGGCAAACTTATATTAATTTTGAATTAAGATACAAAGAACTGGATAGGGCCAGACAAATATATGAAAGATTTGTAATGGTCCATCCAGATGTTAAACATTGGATCAAATATGCAAAATTTGAGGAAAACCATGGTTTCATAAATAGTGCAAGGAAAATTTTTGAAAGAGCTGTTGAATTTTTTGGTGATGAAGAATTAGATGAAAGACTTTTTATAGCTTTTGCTAAATTTGAAGAGAATCAGAAGGAACATGACAGGGCAAGGGTAATTTATAAATATGCATTGGACCATATTCCTAAAGACAGAAACAAGGAGCTGTATAAAGCTTACACAATACATGAAAAAAAGTATGGTGATAGATCTGGTATTGAAGATGTAATTAAATTTCTAGAATACGGACCAGAGAACTGTGTAACTTGGATAAAATTTGCAGAACTTGAAACTCTGTTAGGGGATATTGATAGAGCAAGGGCAATATATGAGATAGCAGTCGGACAGCCCAGATTAGATATGCCTGAGTTGTTATGGAAGAGCTACATAGATTTTGAAGTTGCCCAAAGTGAAACCGACAAAGCCAGGCAGCTGTACGAAAGATTATTAGAAAGAACGGTCCATGTTAAGGTTTGGTTATCATACGCAAAATTCGAGTTGAATGCTGAAAATCCCGATAACATCAACACAGAATTAGCGCGCAGAGTCTATGAACGCGCTAATGAAAGTCTGAAAAGTGCGGGGGAGAAAGAATCTAGAGTATTGCTTTTAGAAGCTTGGAAGGAGTTTGAAACCGAAATTGATGACAAGGAAAAACTCGAGAAGGTTCTAGCGAAGATGCCAAGGAGGGTTAAAAAGAGACAGAAGATTATAAGTGAAGCTGGCATCGAAGAGGGGTGGGAAGAAGTATTTGATTACATATTCCCAGAAGACGAAATGGTTAGACCGAATCTTAAATTGCTTGCTGCTGCCAAAAATTGGCGTAAAAAACAAGTTGTACCAGATAATTCAGAAACTGAAAGTAATGATGAGCAAGACAAGAATTCCGGAGACACACAAGAAGAGGATCCAAGCAAAGAAAGTGCTAATGAAGAATCATGA

Protein sequence:

>DPOGS211365-PA
MPKVAKVKNKAPAEIQITAEQLLREAKERDLEILPPPPKQKISDPEELREYQHRKRKAFEDNIRKNRLVIGNWLKYAQWEESQKQVQRARSIYERALDVDHRNVTLWLKYTEMEMRNRQVNHARNLWDRAVTILPRVSQFWYKYTYMEEMLENVAGARQVFERWMEWQPDEQAWQTYINFELRYKELDRARQIYERFVMVHPDVKHWIKYAKFEENHGFINSARKIFERAVEFFGDEELDERLFIAFAKFEENQKEHDRARVIYKYALDHIPKDRNKELYKAYTIHEKKYGDRSGIEDVIKFLEYGPENCVTWIKFAELETLLGDIDRARAIYEIAVGQPRLDMPELLWKSYIDFEVAQSETDKARQLYERLLERTVHVKVWLSYAKFELNAENPDNINTELARRVYERANESLKSAGEKESRVLLLEAWKEFETEIDDKEKLEKVLAKMPRRVKKRQKIISEAGIEEGWEEVFDYIFPEDEMVRPNLKLLAAAKNWRKKQVVPDNSETESNDEQDKNSGDTQEEDPSKESANEES-