Monarch geneset OGS2.0

DPOGS213652
TranscriptDPOGS213652-TA1719 bp
ProteinDPOGS213652-PA572 aa
Genomic positionDPSCF300165 + 207765-212476
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0052562e-14143.37% 
BombyxBGIBMGA004569-TA2e-7544.97% 
DrosophilaCG5245-PA3e-3131.15% 
EBI UniRef50UniRef50_UPI00022F64122e-3433.64%UPI00022F6412 related cluster n=1 Tax=unknown RepID=UPI00022F6412
NCBI RefSeqXP_002734133.18e-3831.10%PREDICTED: zinc finger protein 45-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2912283202e-3631.10%PREDICTED: zinc finger protein 45-like [Saccoglossus kowalevskii]
NCBI nr blastxgi|2912283201e-4531.10%PREDICTED: zinc finger protein 45-like [Saccoglossus kowalevskii]
Group
Gene OntologyGO:00036764e-10nucleic acid binding
GO:00082701.7e-05zinc ion binding
GO:00056221.7e-05intracellular
KEGG pathway 
InterPro domain[526-554] IPR0130874e-10Zinc finger, C2H2-type/integrase, DNA-binding
[411-431] IPR0227556.8e-08Zinc finger, double-stranded RNA binding
Orthology groupMCL25035 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213652-TA
ATGGAGTCACACACTAAAAAATATCGTTACTGCCTTGCCTGTTTATGTCATGATTCTGAGAGGGAATTAGACAATTTTAAATTTCATAATGGAACTCTTAGTGCTATAATTAAGGAAACTATACGGTTGTGTTATATATGCAGGAAAATAGCATTTAACGTGGAGCGATTTGTTCACAATGTACAAAGTAACCAGTTTCAGTTTGAGAGTACAGAAATTATTACAGAGTCCACAGTTCCAGCCGGAAACAAAATTCCACCATTAGTCAATTTGTCAAAGGAGATATTGCATTCATTAGACAGTAGAGAGAGTAAACATCTAGAAGAAACAGACTCTGAGATATTTAGCAGTAGGGATGTTAAAATTAAGGTAGAAATGAAAAATGAATTAACATCACATAATTTAGAGGAAGTGACCGGTGATAGTGAATTTCAACAGTCAAATTTGAAGGAGGAAGATTTTGCCTTGCAAGATGCAATGAAAGAGGAATCAGATATTTTGGAGAATATTAATTTAAAATTATTAAGTAAGCGGTTGAAAGAGAAAAAAAATTTTGAAAATAGCAAGTCTGATATTTATAAAGATGACAAGAAACTTAAGAGTAAAGATTGCTTTGTGACGGTACTACACATAACAAAGGAACAGTTTGACCAGGAGAAACAGAACATGATGAAGGATCCCAAATATGTGAACAGTGTGTACAGATGCGTGGACTGTATAAAGGGATTCATATTCAAAGAGTCATACGAGAAACACATGCAGAAACACAGTAAGACCATGGGCGAGTTTGAATGTGACATCTGCAAACAGAGAATGTCCACGATGGAGAAGCTGCTCAGCCATCAGAAATACCATAAGATCCGCTATAAGTGCAGGGAGTGTGAGCTGGTCCGCATCAGCCGCCTCACCATCACCGACCACTACACGGCCTGCCACCTCAAGGACACCTTTCACTACAAGTGCCCGCAGTGTGACAAGACCTTCAAACGCCAGATATCGTTGAAGAAGCACATCTCGTACTCTCACCTAAACAGAGGTCGCTCCACGTGTAGCTACTGCCACAAGAGTTACGCCAACAAGGAGGTGCTCAAGGGACATCTCATACGAGCCCACCCATCCGAAGTGTCGTCTACATCAGCGCCGCAGCACGTGTGTGCTGAGTGCGGGCTGGGGTTCCGCGCGCCCTCTCAGCTCAGGAACCACATGATCAAACACTCCGACAACAGAAACTTCTACTGCGTGGAATGTGACAGGAGTTTCAAATCAGACGCAGCCCTGAAACAACACCTCAAAGTGGCGCTACCTCACGTCAACTACATGGAACTACCACTGAAGTGCACTCACTGCGACAAGAGATTCAGCATCCGGAGAGACCTGGAGCGACACGTGAATAGAGTTCACCTCAACATCAAACCTCATCAGTGTGACAAGTGTGATAAGGCCTATATAAACGGATGGTCTCTGAGGGAGCACAAGAGTTACGCTCACGACGGTCGCAAGAGGCCGCTGAAGTTCCCGTGTCCGTACTGCGACAAGATATTTGACCGTAACGCGACCTGCAAGGCCCACGTCCGCACCCACACGGGCGAGCGGCCCTACAGCTGCAGCAGGTGCCCGGCCAGGTTCAGCCAAGCGAGCGTGCTGGCGACGCACGTGAGGCTGGTGCACCTGCACCTCACCAGGGACGGCCGGCCCAAACACGCGGCCCGCGGACACTAG

Protein sequence:

>DPOGS213652-PA
MESHTKKYRYCLACLCHDSERELDNFKFHNGTLSAIIKETIRLCYICRKIAFNVERFVHNVQSNQFQFESTEIITESTVPAGNKIPPLVNLSKEILHSLDSRESKHLEETDSEIFSSRDVKIKVEMKNELTSHNLEEVTGDSEFQQSNLKEEDFALQDAMKEESDILENINLKLLSKRLKEKKNFENSKSDIYKDDKKLKSKDCFVTVLHITKEQFDQEKQNMMKDPKYVNSVYRCVDCIKGFIFKESYEKHMQKHSKTMGEFECDICKQRMSTMEKLLSHQKYHKIRYKCRECELVRISRLTITDHYTACHLKDTFHYKCPQCDKTFKRQISLKKHISYSHLNRGRSTCSYCHKSYANKEVLKGHLIRAHPSEVSSTSAPQHVCAECGLGFRAPSQLRNHMIKHSDNRNFYCVECDRSFKSDAALKQHLKVALPHVNYMELPLKCTHCDKRFSIRRDLERHVNRVHLNIKPHQCDKCDKAYINGWSLREHKSYAHDGRKRPLKFPCPYCDKIFDRNATCKAHVRTHTGERPYSCSRCPARFSQASVLATHVRLVHLHLTRDGRPKHAARGH-