Monarch geneset OGS2.0

DPOGS210557
TranscriptDPOGS210557-TA2241 bp
ProteinDPOGS210557-PA746 aa
Genomic positionDPSCF300304 + 137052-143237
RNAseq coverage1151x (Rank: top 11%)
Annotation
HeliconiusHMEL0095464e-11793.95% 
BombyxBGIBMGA013450-TA2e-17176.96% 
DrosophilaSF1-PA1e-14367.59% 
EBI UniRef50UniRef50_Q9VEJ11e-14167.59%LD36095p n=24 Tax=Eumetazoa RepID=Q9VEJ1_DROME
NCBI RefSeqXP_001600318.11e-15575.14%PREDICTED: similar to zinc finger protein [Nasonia vitripennis]
NCBI nr blastpgi|3071681616e-15270.57%Splicing factor 1 [Camponotus floridanus]
NCBI nr blastxgi|1571048680.047.75%zinc finger protein [Aedes aegypti]
Group
Gene OntologyGO:00037235.7e-12RNA binding
GO:00082702.8e-05zinc ion binding
GO:00036762.8e-05nucleic acid binding
KEGG pathway 
InterPro domain[399-492] IPR0040875.7e-12K Homology
[416-487] IPR0181111.6e-07K Homology, type 1, subgroup
Orthology groupMCL14190 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210557-TA
ATGAGTTCTCGACATAGAGACAGAAGCCGATCACGGTCTCGTGATCGTGATCGTCTTAAGGACAGGGATAAGGGACGGGAACGGGACCGGGATAAAGATAGGGAGAGAGAAAGGGACCGGGACCGTGATAGGGATCGAGAACGCGACAGGGACCGCGATCGAGACCGTGACAGAGACCGAGAGAGAGATAGAGATCGCGATCGAAACAGGGACCGGGAGAGAGACCGGGATCGAGATAGAGAGCGTCATCGGTCTAAGAGGGACAAAGATCGTGATAGAAGCCGCAGCCGTGATCGCCATAAAGAAAAGAGACGCAGTCGCTCTCGAAGCCGTAGTAGGAGTCGCGGCAGAAAATCAAAAGACAGGGATGGTACAATAGCTTTACTGGATCAAATGGTGGGCACCACTACCAAGGCGACGGCTCGCCAGGTGGCCGTTCCCACCTCCATGAACCCAGCAACACAAGCCGCCATACTGGCAGCAGCAGCCGTGGCTCAGACGTTTGTGGCTCAGCGGCGGCTGGCGGCGCCCGTGCAGCCCGCGGCGGCGGCGGCGGCAGCCCTGTCTGCAGCCCTGTCCGCGGCCACCGCCATCCCGCCGCCCACCTCTGTACAGCAGAAGCTGGAGCTGCTGCAGGCGCGCACTGAGGGACGGTACCGCGACAAGCAACCTCCCGACCACCATCCGGACGACGACCACGACGACGGACAAGGTCCTCCCGGGGAGACGGCGGCCGAGCGTCGGGCCCGGCGGCGCCGCACTCGCTGGATGGGCTCCGAGCACGACAAGACCTTCATCCCGGGCCTGCCCACCGTGCTGCCCTCCACGCTCACTCGCGAGCAGGAGGAGCAATATCTACTTCAGCTGCAGATCGAGGAGGTGAGCCGCAAGCTGCGCTCGGGCGACCTCGGCATCCCGGCCAGCGTGGACGAGAGTATGTTATCGACAGAGAAGCGCGCCCCCCTCCACCCCCTCCCTAGTGTAACTATACCGGGCGCCATGCTGGCCCCGCCCCCTCCCGACCCTGTGAGGTCGCCCTCGCCCGAGCCGATCTACTCCACGGACGGCAAGAGGCTGAACACGCGCGAGTACCGCACGAGGCGGAAACTCGAGGAGGAGAGACACCGGCTCGTCACCCGCATGCATCAGATCAACCCCGAGTTCAAGCCGCCGCCCGACTACAAGCCGCCCATCGTCCGTGTGCACGACAAGGTGATGATCCCTCAGGAGGAACACCCCGACATCAACTTCGTGGGTCTGCTCATCGGCCCGCGAGGCAACACGCTCAAAGCGATGGAGAAGGAGACCGGCGCCAAGATCATAATAAGAGGAAAGGGCTCCGTGAAGGAGGGAAAAGTCGGCAGGAAGGACGGCCAGCCGCTGCCCGGGGAAGACGAGCCTCTGCACGCCTACATCACCGCCACCAACGCCGACTGCGTCAAGAAGGCCGTCGAGAAGATCAAGGAGGTGATCCGTCAGGGTGTGGAGGTGCCCGAGGGACAGAACGACCTCCGCCGCATGCAGCTGAGGGAACTGGCGCAACTCAACGGGACTCTCAGGGAGAGCGACTCGCCGCGCTGCGCCAACTGCAGCGCCGCCGACCACAAGACGTGGCTCTGTCCGGACAAGCCGAACGTGACGAACAGTATCGTGTGTTCATCGTGCGGCGGCGCGGGACACATCGCGCGCGACTGCCGCGCCAAGAGACCGGGACACGCGCCGCCCGCCCTGCATCACGACAAGGCTAAGATCGACGAGGAGTACATGTCGCTGATGGCGGAGCTGGGGGAGGCGCCGCCCGGGGTCGGCGGAGTCACCGGCCCGTCCGCCGCGGCCGCTCGACGCACGCACGGACCCTTCGCCCCCGCGCCGCCGCCGCGGGCTATCATGCCGGCTCCCGGGAACATGGGCGGCTTCCACGCGATGACTCACCCTCCTCACCCGCCGCACCCTCCGCATCCTCCGCCGCACGCTCCCTGGCTCGGCGCGGTGTCCACTGGAGCGTCCGTGAGCGCGGCTCAGCCCCCGCCGCCCGGCAGCGCGCCGCCCTTCCCTCCGCCGCCGCCGCACCAGGCTGACGGCACTCTCCCGCCGGGTTCTTCGCCGCACCTGCCGCCGCCGCCAGGTATGCTGGCCGGTGGTCCGTGGCGCGGGTTCGCTCCCCCGCCGCCCTCCCGCCGAGGAGGGGGGCGGCGTCTGTTCGCTCCGCCGCCGCCGCCCCCGCCGGTCTCCTCCGCATAA

Protein sequence:

>DPOGS210557-PA
MSSRHRDRSRSRSRDRDRLKDRDKGRERDRDKDRERERDRDRDRDRERDRDRDRDRDRDRERDRDRDRNRDRERDRDRDRERHRSKRDKDRDRSRSRDRHKEKRRSRSRSRSRSRGRKSKDRDGTIALLDQMVGTTTKATARQVAVPTSMNPATQAAILAAAAVAQTFVAQRRLAAPVQPAAAAAAALSAALSAATAIPPPTSVQQKLELLQARTEGRYRDKQPPDHHPDDDHDDGQGPPGETAAERRARRRRTRWMGSEHDKTFIPGLPTVLPSTLTREQEEQYLLQLQIEEVSRKLRSGDLGIPASVDESMLSTEKRAPLHPLPSVTIPGAMLAPPPPDPVRSPSPEPIYSTDGKRLNTREYRTRRKLEEERHRLVTRMHQINPEFKPPPDYKPPIVRVHDKVMIPQEEHPDINFVGLLIGPRGNTLKAMEKETGAKIIIRGKGSVKEGKVGRKDGQPLPGEDEPLHAYITATNADCVKKAVEKIKEVIRQGVEVPEGQNDLRRMQLRELAQLNGTLRESDSPRCANCSAADHKTWLCPDKPNVTNSIVCSSCGGAGHIARDCRAKRPGHAPPALHHDKAKIDEEYMSLMAELGEAPPGVGGVTGPSAAAARRTHGPFAPAPPPRAIMPAPGNMGGFHAMTHPPHPPHPPHPPPHAPWLGAVSTGASVSAAQPPPPGSAPPFPPPPPHQADGTLPPGSSPHLPPPPGMLAGGPWRGFAPPPPSRRGGGRRLFAPPPPPPPVSSA-