Monarch geneset OGS2.0

DPOGS210993
TranscriptDPOGS210993-TA1314 bp
ProteinDPOGS210993-PA437 aa
Genomic positionDPSCF300004 + 262424-267318
RNAseq coverage218x (Rank: top 45%)
Annotation
HeliconiusHMEL0250264e-11797.07% 
BombyxBGIBMGA006476-TA1e-11895.28% 
DrosophilaCstF-64-PA4e-9480.90% 
EBI UniRef50UniRef50_F4WHC47e-10085.00%Cleavage stimulation factor 64 kDa subunit n=8 Tax=Formicidae RepID=F4WHC4_ACREC
NCBI RefSeqXP_001951232.14e-10853.32%PREDICTED: similar to cleavage stimulation factor 64 kDa subunit [Acyrthosiphon pisum]
NCBI nr blastpgi|2700007446e-11859.42%hypothetical protein TcasGA2_TC004378 [Tribolium castaneum]
NCBI nr blastxgi|2700007442e-12359.65%hypothetical protein TcasGA2_TC004378 [Tribolium castaneum]
Group
Gene OntologyGO:00036761.8e-29nucleic acid binding
GO:00001664.8e-29nucleotide binding
KEGG pathway 
InterPro domain[19-92] IPR0005041.8e-29RNA recognition motif domain
[9-99] IPR0126774.8e-29Nucleotide-binding, alpha-beta plait
Orthology groupMCL11020 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210993-TA
ATGGATAAAAGCAAGGAAAAAGAAGAACAGAGTATCATGGATAAATCCATGAGATCTGTTTTTGTGGGAAATATACCTTACGAAGCAACGGAAGAAAAATTAAAAGATATATTTAGTGAAGTTGGCCCTGTGTTATCCTTTAAACTAGTTTTCGATAGAGAAACTGGTAAACCCAAAGGTTATGGTTTTTGTGAATATAAAGATCAGGAAACTGCTCTCAGTGCTATGAGAAATCTAAACGGATATGAAATTGGTGGTAGATCACTTAGGGTTGACAATGCTTGTACTGAAAAATCGAGAATGGAAATGCAGGCATTAATGCAAGGGCCTCAGGTTGAAAACCCATATGGAGATCCTGTAGATCCAGAGAAAGCTCCCGAGGCTATTAGTAAAGCTGTTGCTACTTTACCACCTGAACAAATGTTTGAACTTATGAAGCAAATGAAATTATGTATTCAGAACAATCCAACAGAAGCTAGGAACATGTTGTTGCAAAATCCTCAACTTGCGTATGCTTTGTTACAAGCACAAGTTATAATGAGGATTGTGGACCCCATTACAGCCGTTAGTATGTTGCATCCCAGTAATGCTGTTCCTCCAGTGCTACAACCTGGTGATAAGCCGCCCGTCTCCAATAACAATCCAACAGAAGCTAGGAACATGTTGCTGCAAAATCCTCAACTTGCGTATGCTTTGTTACAAGCACAAGTGATAATGAGGATTGTGGACCCCATCACAGCCGTTAGTATGTTGCATCCCAGTAATGCCGTTCCTCCAGTGCTACAACCTGGTGATAAGCCACCTGTCTCCAATGTTTATATGCCGAATCCACCACCTCCTGTTCAACAACCACCTCTATTGGCTAATCCACCTCCGACTCAGAATCAATATGTCCCTCGTCCACCACAACAAATGGATACTGATCTACGTAACTCTCGACCGCCTCTATTAGACCAAGACATGCGCAGTCTGCATCACACTGCGCCGCCGCCCGTACATCCGCCCGTTCAGGATAATTCGTTTCCCCGAGATCCTCGTCTCGCTAGTATGCCGTTCAACTCAGACCCTCGCGTGCGATCGGACCCACGCACGCAAACTAAAATGCCACCCCAAATGCCCCCTGGCATGCCGAGTATGGCACAAGCGAGGACCATACAGGGTATTCCTTCAGGTGCATCTGACCAAGAGAAGGCTGCCTTAATTATGCAAGTGTTACAGTTATCTGATGAACAAATAGCACTGTTGCCTCCAGAACAACGTGCTAGTATTCTTATGCTTAAAGAACAAATAGCAAAAAGCACACAACAACGTTAA

Protein sequence:

>DPOGS210993-PA
MDKSKEKEEQSIMDKSMRSVFVGNIPYEATEEKLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNLNGYEIGGRSLRVDNACTEKSRMEMQALMQGPQVENPYGDPVDPEKAPEAISKAVATLPPEQMFELMKQMKLCIQNNPTEARNMLLQNPQLAYALLQAQVIMRIVDPITAVSMLHPSNAVPPVLQPGDKPPVSNNNPTEARNMLLQNPQLAYALLQAQVIMRIVDPITAVSMLHPSNAVPPVLQPGDKPPVSNVYMPNPPPPVQQPPLLANPPPTQNQYVPRPPQQMDTDLRNSRPPLLDQDMRSLHHTAPPPVHPPVQDNSFPRDPRLASMPFNSDPRVRSDPRTQTKMPPQMPPGMPSMAQARTIQGIPSGASDQEKAALIMQVLQLSDEQIALLPPEQRASILMLKEQIAKSTQQR-