Monarch geneset OGS2.0

DPOGS201870
TranscriptDPOGS201870-TA2958 bp
ProteinDPOGS201870-PA985 aa
Genomic positionDPSCF300191 + 129010-132406
RNAseq coverage758x (Rank: top 17%)
Annotation
HeliconiusHMEL0134340.070.95% 
BombyxBGIBMGA006044-TA0.073.07% 
Drosophilancm-PA0.075.94% 
EBI UniRef50UniRef50_Q9VJ870.075.94%Pre-mRNA-splicing factor CWC22 homolog n=17 Tax=cellular organisms RepID=CWC22_DROME
NCBI RefSeqXP_001650254.10.073.21%cell cycle control protein cwf22 [Aedes aegypti]
NCBI nr blastpgi|1571084990.073.21%cell cycle control protein cwf22 [Aedes aegypti]
NCBI nr blastxgi|1582973800.059.34%AGAP007874-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160701.1e-69RNA metabolic process
GO:00054881.4e-67binding
GO:00055151e-38protein binding
KEGG pathway 
InterPro domain[106-313] IPR0160211.1e-69MIF4-like, type 1/2/3
[116-360] IPR0160241.4e-67Armadillo-type fold
[129-312] IPR0038901e-38MIF4G-like, type 3
[415-521] IPR0038913e-27Initiation factor eIF-4 gamma, MA3
Orthology groupMCL10398 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201870-TA
ATGGATCGCAGTAAGGAAAACAAACGACGCCACGATCGCTACAATGAGAATGAAGAATATAGGGATAACAGATCTAGAAAGCGTCGATCAAGATCACGATCTCGAAATAACGATCGGTCTAAAGATTCTAGAAGGAAAAGAAATCATTCCAGGTCGAGGTCTCCCGCTGTTGGAGACAGAAAAGATAGACGCCGTAATGATGCGCCAAAAGATGTTCCAAATAAGACAGTTGTACCATCTGCCCCCAAGAAAGCTAAAGAGACTGATATGTTAAACACTCGAACAGGTGGGGCTTATATACCGCCAGCGCGTCTTAGAATGATGCAAGCTCAAATAACCGACAAATCTTCTGTGGCATATCAAAGACTGGCTTGGGAAGCTCTAAAGAAGTCTGTGCACGGACATATAAACAAAATAAACGTTGGTAATATTGGTATCATTATTAAACAACTTCTCAAAGAAAATATTGTTCGAGGCCGTGGTTTGTTGTGTAGATCAGTTATACAGGCACAAGCTGCCTCACCAACATTCACAAATGTCTACGCCGCTTTGGTAGCAGCAGTGAACTCTCGTTTCCCTAATATAGGAGAACTCTTGCTTAAAAGGTTAGTTATACAATTCAAGAGAGGTTTCAAAAGAAACGATAAATCAACCTGCATATCATCAGCATCGTTCATAGCCCACTTGGTGAACCAAAAAGTAGCACACGAAATACTTGCTCTAGAATTATTGACATTACTTGTAGAAACACCGACAGATGATTCCGTAGAAGTTGCTATTGCATTTTTAAAGGAGTGTGGACAAAAACTAACAGAAGTGTCATCTAAAGGTGTCAATGCTATATTTGAAATGTTACGAAATATTCTTCATGAAGGGAAATTAGATAAAAGAGTACAATACATGATAGAGGTGGTGTTTCAAGTGTGGAAGGATGGGTTTAAGGATCATCCGGCTGTGATAGAGGAACTGGAGCTGGTGCCCGAAGAGGAACAATTCACTCACTTGCTGATGTTAGATGATGCTACCGATGCACAGGATATCCTGAATGTATTTAAATTTGATGACAAATATGAGGAGAATGAGCAGAAATATAAGGCCTTGTGTGGTGAGATATTGGGATCGGATGCAGAGTCCGGTGAAGATGATGGTTCAGAGGAATCGGGGAGTGAGGAGTCTGACGAGGAAGATGAAAAACAGAAGGAAGTCACAATTATTGACAACACAGAAACTAACTTAGTAGCTCTAAGAAGAACTATTTATCTTACTATAAACTCTAGTTTAGATTTCGAGGAGTGCGCACACAAGCTCATGAAAATGCAATTAAAACCCGGCCAAGAGGTCGAGCTATGTCACATGTTCCTTGACTGTTGTGCTGAACAGAGGACCTATGAAAAGTTCTATGGTCTGCTCGCACAACGTTTCTGCAACATCAACCGCATCTATATCGGCCCATTTGAGGAGATCTTCAAAGATTCTTATGCCACTGCTCACAGGTTGGACACCAATCGCTTAAGGAACGTTAGCAAATTTTTCGCACATCTTCTTTTCACTGATTCAATCAGCTGGGAAGCATTGGAGTGTGTCAAGCTGAACGAAGAGGACACAACAAGTTCCAGTAGAATTTATATCAAGATACTATTCCAAGAACTAGCTGAATATATGGGTTTGAAGAAATTGAATGATCGCCTCAAGGATCCGACATTACAGCAAGCCTTCTCAGGAATATTCCCAAGGGACAACCCAAAGAACACTCGGTTCTCTATCAACTTTTTCACATCTATAGGACTGGGAGGCTTAACAGATGAACTGAGAGAACATTTGAAGCAGATGCCAAAGAATGTGCCGCCACCCATCACAGAGATAAATATTGATAGTGAATCGAGTTCAGACTCCAGCTCTAGTTCCAGCAGCTCAGACAGCAGCAGTTCCAGCTCAAGCGACTCCAGCTCTAGTGATTCAGAGGCTGGAAACAGCAAAAAAGAGAAAAAATCGAAAAAGAAGACTAAAAATAAAACATCAAAAACTCCAGAACCAGAAGAACAACCCAAGAATAAAGAATCAGATAAAACAGAAAGATGGGGACATGATGGATTTTACGACACTTACGGAAAAGACGCCAGGAGAAATGAAGTCAGTTCTGGAAGGAACGGGAGGGAAAAAGGTGAAAATACTAGAAGCAGGCGCGACTTACAAAATGGAGAAGAAAGGCACAGGAATAATGACAGGAGGAACGAACGTGCAGAAGCTGTTAGGAATAGGGGCAGAGAAGAACACAGAGACTTGGCCAAAGAAAGAGTTGGAGACCGTGACAAGAAAGATAAAGGGAATAGGGATAGAGACGCCAACAATCGTTATGATGATAATGAAAATGACGTTAGTAAGAGACGTCACGCTCGGGACTCACGGAAAGACAGAGATGACACGAGATCGAAGAGAAATGGAAAAGAAACGGAAGAACATAGACAACGAGATGATCGCGAAAAGGAAAACGGAAAGAAAGATAGGAAACGTTACGACCGTGATGATGAGAGGGAAGCTCGGAAGAAGAAAGACGAACAGGACAAAGAGAATAGAACAAGAGATAGAGACGGCGAGGCAAGCAAACGACACGACCGAGAGGATGACATAAACAAGAATAGCGAAGGAAACGAGGGTCGGCAGAAAGACAGGAGGAACAGACGCGGAGATGACGACGGTGATAGAAGGAAACGAGAAAAGGAGGCCAGAAGAGACAAGGATAGAAACACACCTGACACCACCAGCGCTACACGACGTGAGGTGGACGAATATAAAACAAAAGACGACAAAAGCACTGAAGATGTAAACTGTCTCGGCTCCCGGTACTGGGATACGTTTGATGTGCAAAGTAAAAATGATACAAGCAAAGAGGATGATAGCTTCGAAGCGAGGACTCCCGAAAAAAATGAAAAATACTACGAGAGAAGACGCGAGCGATGA

Protein sequence:

>DPOGS201870-PA
MDRSKENKRRHDRYNENEEYRDNRSRKRRSRSRSRNNDRSKDSRRKRNHSRSRSPAVGDRKDRRRNDAPKDVPNKTVVPSAPKKAKETDMLNTRTGGAYIPPARLRMMQAQITDKSSVAYQRLAWEALKKSVHGHINKINVGNIGIIIKQLLKENIVRGRGLLCRSVIQAQAASPTFTNVYAALVAAVNSRFPNIGELLLKRLVIQFKRGFKRNDKSTCISSASFIAHLVNQKVAHEILALELLTLLVETPTDDSVEVAIAFLKECGQKLTEVSSKGVNAIFEMLRNILHEGKLDKRVQYMIEVVFQVWKDGFKDHPAVIEELELVPEEEQFTHLLMLDDATDAQDILNVFKFDDKYEENEQKYKALCGEILGSDAESGEDDGSEESGSEESDEEDEKQKEVTIIDNTETNLVALRRTIYLTINSSLDFEECAHKLMKMQLKPGQEVELCHMFLDCCAEQRTYEKFYGLLAQRFCNINRIYIGPFEEIFKDSYATAHRLDTNRLRNVSKFFAHLLFTDSISWEALECVKLNEEDTTSSSRIYIKILFQELAEYMGLKKLNDRLKDPTLQQAFSGIFPRDNPKNTRFSINFFTSIGLGGLTDELREHLKQMPKNVPPPITEINIDSESSSDSSSSSSSSDSSSSSSSDSSSSDSEAGNSKKEKKSKKKTKNKTSKTPEPEEQPKNKESDKTERWGHDGFYDTYGKDARRNEVSSGRNGREKGENTRSRRDLQNGEERHRNNDRRNERAEAVRNRGREEHRDLAKERVGDRDKKDKGNRDRDANNRYDDNENDVSKRRHARDSRKDRDDTRSKRNGKETEEHRQRDDREKENGKKDRKRYDRDDEREARKKKDEQDKENRTRDRDGEASKRHDREDDINKNSEGNEGRQKDRRNRRGDDDGDRRKREKEARRDKDRNTPDTTSATRREVDEYKTKDDKSTEDVNCLGSRYWDTFDVQSKNDTSKEDDSFEARTPEKNEKYYERRRER-