Monarch geneset OGS2.0

DPOGS211174
TranscriptDPOGS211174-TA1407 bp
ProteinDPOGS211174-PA468 aa
Genomic positionDPSCF300007 + 352010-353952
RNAseq coverage258x (Rank: top 41%)
Annotation
HeliconiusHMEL0172344e-10155.88% 
BombyxBGIBMGA003161-TA2e-6863.46% 
DrosophilaCwc25-PA4e-2937.95% 
EBI UniRef50UniRef50_UPI00015B60E41e-3847.70%UPI00015B60E4 related cluster n=1 Tax=unknown RepID=UPI00015B60E4
NCBI RefSeqXP_001601556.12e-3947.70%PREDICTED: similar to ENSANGP00000012399 [Nasonia vitripennis]
NCBI nr blastpgi|3071983912e-3831.36%Coiled-coil domain-containing protein 49 [Harpegnathos saltator]
NCBI nr blastxgi|1565537977e-5932.92%PREDICTED: hypothetical protein LOC100114704 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[67-158] IPR0222095.9e-18Pre-mRNA splicing factor
[12-47] IPR0193398.2e-15CBF1-interacting co-repressor CIR, N-terminal
Orthology groupMCL34871 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211174-TA
ATGGGAGGCGGTGATTTGAATACGAAAAAAGCTTGGCATCCCAGCAACTTAAAAAATCAGGAACGTGTCTGGAAAGCGGAGCAAGCTGCTGCCGCCGAAAAGAAACGAATTGAGGAATTAGAGCGGGAACGAGCACAGGAGAGAGATCGAGAGGAGTTGAATGCCTTATCCAGACAAAATGTCAACACTAATACAACTCAAGAAAATAGGCTCCATTGGATGTACGATAAACCAGATAAACAGGTGCAACAAGAAGATTATTTGTTGGGAAAGGCGATTGATAAAAACTATGACCAAGGTGATAAGCAGGATCAAAATGAGATCCCAGCAGTATCCCGCCGAGTTGTTGGATCTAGCATGATGACTGCTGGAGGCGATGTTCAAGTGGATTTGGCAAGGAAACTCCGCGAGGATCCATTGTTGCTGGTGAAGGAAAGGGAGAGAGCTGCGAGGGCGGCATTGCTTAACAATCCAATCCAGAGACGCAAACTGACTGAGCTCTTAAGAAAAGAACAAGAAAAGAAGATATTAAAAAAAAAGTCCAAGAAAAGTAACATTGATGAACTGCTTGCTTCCAAATTAAGTGCCCTAGCTGGTGAAAAAAGCATGAACCTAGCAAAATTACTTGACTCAGACAGTTCATCGTCAGATAGTTCGTCATCATCATCACGGAAGAAAAAAACTAAAAAAAAGAAGAAAAAGATGTCAAAACATAAATCCAAAAAGCACCAAGGAAGTGACAGTGACGAAAAACAAGAAAAAAGTAAAAAGTCCAAAGATAAGAAGAAACATAAGGATAGAAAAGATAAAAACGATGACAGTGATGCTGGAACCATTTCCAAACGAGAAAAATATCACAATGACTTAAAGTTCGATGATAAAAAATATCCTTCATCAATGAAACGTAAAAGTCATGTCGATAGTGATGGTGAACCTCAAAGAAAAAGTAGGAAATCTCTTAAAGAAGTTTCACCATATCAAAAAAGAAATGAAAGCTATAGCCAGGATCGGTATAGACGCTCCAGCGGTGATAGAAACAGATCGAGGAGGGAAACCGAAAATGATGACAGGAGAAACTCTGATCGAAGGGGGCCATCAAGGTCCAGGTCAAGATATGATCAAAAAACTCCCCGGGATAATCAGAGATCTGAGTTAAGTCAGGACGAGAAGGCAGCCAAATTAGCGGCTATGGTTCAAGCTGGTGCGGAGAGGGAAATACAGAGGGGGAGAAGGGTCGCTGAGCAATTAGCAGAGAAAGCTGCCGAAGACACTACGACCTTGCCAAGGTCATCCCACAAGAACCAAGCGCGGACACTACCGGACTCCCTCGAAAGCCGCATCCATTCAAATCGTCACAACATTCAACGCGACAAACGACACATGAACGAACATTTTGCTAGGAGATGA

Protein sequence:

>DPOGS211174-PA
MGGGDLNTKKAWHPSNLKNQERVWKAEQAAAAEKKRIEELERERAQERDREELNALSRQNVNTNTTQENRLHWMYDKPDKQVQQEDYLLGKAIDKNYDQGDKQDQNEIPAVSRRVVGSSMMTAGGDVQVDLARKLREDPLLLVKERERAARAALLNNPIQRRKLTELLRKEQEKKILKKKSKKSNIDELLASKLSALAGEKSMNLAKLLDSDSSSSDSSSSSSRKKKTKKKKKKMSKHKSKKHQGSDSDEKQEKSKKSKDKKKHKDRKDKNDDSDAGTISKREKYHNDLKFDDKKYPSSMKRKSHVDSDGEPQRKSRKSLKEVSPYQKRNESYSQDRYRRSSGDRNRSRRETENDDRRNSDRRGPSRSRSRYDQKTPRDNQRSELSQDEKAAKLAAMVQAGAEREIQRGRRVAEQLAEKAAEDTTTLPRSSHKNQARTLPDSLESRIHSNRHNIQRDKRHMNEHFARR-