Monarch geneset OGS2.0

DPOGS213231
TranscriptDPOGS213231-TA1434 bp
ProteinDPOGS213231-PA477 aa
Genomic positionDPSCF300394 + 37805-48863
RNAseq coverage188x (Rank: top 48%)
Annotation
HeliconiusHMEL0169650.090.50% 
BombyxBGIBMGA002234-TA8e-17578.75% 
Drosophilagl-PA9e-8957.62% 
EBI UniRef50UniRef50_B3LWA45e-8758.33%GF18117 n=5 Tax=Drosophila RepID=B3LWA4_DROAN
NCBI RefSeqXP_001847903.11e-10850.00%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700402152e-10750.00%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700402152e-11346.13%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00036766.2e-16nucleic acid binding
GO:00082703.6e-07zinc ion binding
GO:00056223.6e-07intracellular
KEGG pathway 
InterPro domain[353-380] IPR0130876.2e-16Zinc finger, C2H2-type/integrase, DNA-binding
[362-384] IPR0070873.6e-07Zinc finger, C2H2
Orthology groupMCL18445 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213231-TA
ATGGATTGCTACGTGCCCAACAACCCTCAGTATCTCGGATGCTGTGGGTGCTGCGACCCGCTGCAGTGCGCCTGCCAGATACGACTACCGGAGGACTGCTGTCAGCAAGAATCCGAGAACTGCTGTCCGGAAAATGGCGAGGAATTGAACACTCTAGGGGAGTCCATAGCGACCTGCGATGTGGGGCTCGATGTAAACGAGCCTGGCTGGCCGGCCGAGGACATGGGCTCCTTCTCCCTGCCGCCCCTCGACCTGGATCCCCTACCATCGTTATTCCCATTCTCACCCTGCTCCGGATACAACCGGAACAATGGGGGGGAATGTCGGGAGAGGGGGGGAGGGGAGGCCGCGGATGTGCTGCTCTCTCTGAAGCATGCCGTGGTTCATGGAGACTGCGCAGGGGACGCCGTACATCCTCAGATGGTGGTGAACAGCGGCGGTGGCTATCCCTACTATGAGCACTACGGCACCGCTCCTCTCTTCCCCACCATGAGTGTCAACGTCTCCATGAACATGACCATGCACGGCTGTCCCTCAGACCAACTCTGTTCTCAGGTACAATGGAATCAAAACGCCTCGGCCCCTTCAGTGAACGTAGTGTACCCTCAAACACAGAGCGTCATACCGGTGTCCTATCCATCAGCGACGTACTCGTTCACAGCTGACTTCAGGGCACCGAATCAATCCGATCCTCTCATCACCGCGAGCTCAACCTTCAAACCATTACAACTCCAAAACACTCAGAAACCCAATCCGAACTACTTGTTCCAACAAAAGCCAAATTTTTCGAATCAAAAGAACTTGGGAACTGTGCTAAAACGATCCCCATCCAAAATATATATGCCAGAGAGTCCAAAGGAACAAATGGGGAACGGTTACGTTTTGAACCATCAAGGACAACTGCATCAGGACTTCGGATATACCACGTGCGTTAATTCTTCGGGAAAAGTGCAAGTGGGTGCTCTAAGCGCGTGTTCCGAGGACGATGAACAGAAGCCCAACCTGTGTCGCATCTGTGGCAAGACGTACGCCAGACCAAGCACGTTAAAAACTCACCTCAGGACTCACTCTGGTGAACGACCGTACAGATGTGGAGACTGTAATAAAAGCTTCTCTCAAGCTGCCAACTTAACGGCACACGTTCGAACTCACACTGGCCAGAAACCATTTAGATGTCCAATTTGTGACCGACGATTCAGCCAGAGTTCCAGTGTTACCACACATATGAGAACACATTCGGGAGAAAGGCCTTACCAATGTCGATCCTGCAAAAAGGCATTTTCTGACAGCTCTACGTTGACAAAGCATCTGCGCATACATTCTGGTGAAAAACCATACCAGTGCAAACTATGCCTATTAAGGTTTTCTCAATCCGGCAATTTGAATAGACATATGCGTGTACACGGCAACATGTCAGGCGGGATGCTTGGCTGA

Protein sequence:

>DPOGS213231-PA
MDCYVPNNPQYLGCCGCCDPLQCACQIRLPEDCCQQESENCCPENGEELNTLGESIATCDVGLDVNEPGWPAEDMGSFSLPPLDLDPLPSLFPFSPCSGYNRNNGGECRERGGGEAADVLLSLKHAVVHGDCAGDAVHPQMVVNSGGGYPYYEHYGTAPLFPTMSVNVSMNMTMHGCPSDQLCSQVQWNQNASAPSVNVVYPQTQSVIPVSYPSATYSFTADFRAPNQSDPLITASSTFKPLQLQNTQKPNPNYLFQQKPNFSNQKNLGTVLKRSPSKIYMPESPKEQMGNGYVLNHQGQLHQDFGYTTCVNSSGKVQVGALSACSEDDEQKPNLCRICGKTYARPSTLKTHLRTHSGERPYRCGDCNKSFSQAANLTAHVRTHTGQKPFRCPICDRRFSQSSSVTTHMRTHSGERPYQCRSCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGNMSGGMLG-