Monarch geneset OGS2.0

DPOGS214448
TranscriptDPOGS214448-TA1239 bp
ProteinDPOGS214448-PA412 aa
Genomic positionDPSCF300441 - 58917-60522
RNAseq coverage122x (Rank: top 57%)
Annotation
HeliconiusHMEL0045465e-3430.56% 
BombyxBGIBMGA004382-TA3e-5536.05% 
DrosophilaCG5245-PA1e-2929.41% 
EBI UniRef50UniRef50_UPI000223636D2e-4033.99%UPI000223636D related cluster n=1 Tax=unknown RepID=UPI000223636D
NCBI RefSeqXP_001944705.16e-4135.62%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|3442993996e-4033.99%PREDICTED: zinc finger protein 420-like [Loxodonta africana]
NCBI nr blastxgi|2607945791e-4732.00%hypothetical protein BRAFLDRAFT_119633 [Branchiostoma floridae]
Group
Gene OntologyGO:00036768.1e-08nucleic acid binding
KEGG pathway 
InterPro domain[339-365] IPR0130878.1e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214448-TA
ATGGAAATGTTAGGAGAAACTGAAAATGTAACTGGAAGTGATGACCAAAAAACAGTAGAAAATGCTGATATCAAACATTGTTTGGATGAAAAAACTGAGCTTCGGTTAATAAACGAAGAACTGACGGAATGTAAGATGGAAATGGATGAACAAAAGTTGGACGTTAGTGGAGGAGAAGTTTCAACATTCCGGAAGTACAGTTTACGTGAAGTCAAGCCAGTGATCAAACCTTTTCAACACTACATCAAAATGAGGATGAGGAACCACCAAATCAAGAGTGAGTTTGAAGAGCAGTACATGTGTGAAGTGTGCTACAAAATATCCACTTCACTATCATCTTACCGCACTCATATGAAGCAGCATTCTGTTATAAAAGAGTATTTGTGTTCACAATGTGATGATGTCTTCAAAACTAACCGACAGTTGTGTGACCACAAAGTCCAATCTCACAAGGACGGCTGCTTCTCGTGCCAGGAATGCCAACTGGCATTCTCTGATCATTACACATACCTCCTCCATACATTTCAACACATCAGACCACCGTACATTTGCCCTGAATGCCGCGCACCTCTATCTAAATATAAATCCTTAGCTGCCCATTTGAAAATTCACTTCAATGATTTAGTCGAATGCCACATATGTCATAAGGAGGTGCGTCAGATGAGGCTAACTGAGCATATAAGAGATCACAAGGGAGGTGTTGTCTGTGTGGAGTGCGGGAAGGTGTGTAACAATAAAAAAAACTATGATTATCATTTAATTTCTCACACCGGAGTCAAGCCCTACCAATGTCTGTATTGCGGCAAGGGATTCTCTACTGCCAACCAAATGAAGACCCATACAGCAGTCCACACGAATATAAGACGTTATAAATGCGATATCTGTCACCAAGAGTTCAAACAGCACACAGATTTGACTCGCCACAAACAGGGACACGATGGATTGAGGTTTATCTGTGATTACTGCAGGAAGAAGTTCAAACAGAAGTTGTATTTGGCCTACCACATACGTAACCACACAAACTACAGGCCCTACAAGTGTCCTAACTGTCAGTCATCATTTACGACTCTCGCTAGTCTCAGATCACACATACAATATACTCATGTGTCCAAAAGAAAGTTCTTCTGCCTCGTATGTGATTACGGTTTCCGAGTGCATTGTCAATACTTGAGACATTTGAGATCCAAACGACATTTACAGAATGCATTCAGTTCAGAGGAAAATAATCAGACAAACTAA

Protein sequence:

>DPOGS214448-PA
MEMLGETENVTGSDDQKTVENADIKHCLDEKTELRLINEELTECKMEMDEQKLDVSGGEVSTFRKYSLREVKPVIKPFQHYIKMRMRNHQIKSEFEEQYMCEVCYKISTSLSSYRTHMKQHSVIKEYLCSQCDDVFKTNRQLCDHKVQSHKDGCFSCQECQLAFSDHYTYLLHTFQHIRPPYICPECRAPLSKYKSLAAHLKIHFNDLVECHICHKEVRQMRLTEHIRDHKGGVVCVECGKVCNNKKNYDYHLISHTGVKPYQCLYCGKGFSTANQMKTHTAVHTNIRRYKCDICHQEFKQHTDLTRHKQGHDGLRFICDYCRKKFKQKLYLAYHIRNHTNYRPYKCPNCQSSFTTLASLRSHIQYTHVSKRKFFCLVCDYGFRVHCQYLRHLRSKRHLQNAFSSEENNQTN-