Monarch geneset OGS2.0

DPOGS205192
TranscriptDPOGS205192-TA1116 bp
ProteinDPOGS205192-PA371 aa
Genomic positionDPSCF300265 - 389392-390883
RNAseq coverage351x (Rank: top 33%)
Annotation
HeliconiusHMEL0134497e-14066.01% 
BombyxBGIBMGA008783-TA7e-8342.55% 
DrosophilaCG31224-PA1e-1735.50% 
EBI UniRef50UniRef50_UPI000223652B1e-1832.46%UPI000223652B related cluster n=3 Tax=unknown RepID=UPI000223652B
NCBI RefSeqXP_001120284.13e-1832.14%PREDICTED: similar to zinc finger protein 91 [Apis mellifera]
NCBI nr blastpgi|3266780922e-2029.97%PREDICTED: zinc finger protein 160-like [Danio rerio]
NCBI nr blastxgi|1486737773e-2529.79%RIKEN cDNA A830058L05, isoform CRA_a [Mus musculus]
Group
Gene OntologyGO:00036764.3e-06nucleic acid binding
GO:00055157e-06protein binding
GO:00036771.9e-05DNA binding
KEGG pathway 
InterPro domain[304-341] IPR0130874.3e-06Zinc finger, C2H2-type/integrase, DNA-binding
[1-36] IPR0090577e-06Homeodomain-like
Orthology groupMCL19095 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205192-TA
ATGAGGCAAGCTATTAAAGCTGTTAGGAGCGGGGAAATGACTTGCACCATAGCAGCCATCACTTATGGCGTGCCGAAGAAGACGTTGACAGCCAAAGCCAATGACAAAGACAGAGACGAAGACGACAGCAAGGACTCTACCAGCGAGAAACACTACAAGCTGATAGAGGAAATCAAGAATATATTGAAATTCACCAACGCCATACCGTTCAAGACGAAGACTACCCGCTACTACTGCGCGTACTGTTCCACGGATGGGCCGATATTCGAGGACGGGGACTGTCTCCGGCTGCACACGAGACTGGAGCACGACAAAGAGAGGTTGCGGGGCGTGGAGAAGTTCATGAGGCCGCAGTGGATGAACGAAATCGTCAAACTGGACATAGAGGGCCTGCATTGTACCGCATGCAGGGTGAGGCTGCCCGACTGGAACAGGATGTTCGTACACTTCGCCAACGCGCATAACATAGAGTTCGATCAAGCGTACACCAGGGTGATCCCGTACGCGCTGACCTCACAGCTGAGATGCGTGCTGTGCAAAGAGGGCTTCCACAACTACGGCCACGTGGACAGCCACATGAACCATCACTACAGCAACTATATATGCCACCAGTGTGGCGATACCTTTTTGGCTGCGAGCCGTTTCGATAAGCACCTCAAGGTGCACAAGATAGGCAACTATCCGTGCGAGTTGTGCGACAAAGTCTTCACGCTAGAAAAGTATAGGACGAAGCATAAAAACCTGGTCCACGACCAGAAGAGCATGGTGAAGTGTCTGTATTGCAGCGAGAAGTTTGTCGGCCTGTTCCAGAGGCACTTGCACGTGACCGAACAGCATAAAGACAAGGTTAAGGTGTACACGTGCGAGTTGTGCGGCAACTGCTACACCTGGAAGACTTACTTCCTGGCGCACATGAGGAGACGGCACGGGACTGATAAGAAGCACCAGTGCAAGCATTGCGACAAATCGTTCTTGATGAAATACGAGCTGAGAAACCACATGATAAGGCATACGGATGAGAAGTTCCTTTGCGGTCTCTGCGGCAAGAGCTTCAAGAGAATGCTCACCTTGAAGAAGCACTGTCTAGCTCATGAAGAGTTCAGCGCTGAAGATTGA

Protein sequence:

>DPOGS205192-PA
MRQAIKAVRSGEMTCTIAAITYGVPKKTLTAKANDKDRDEDDSKDSTSEKHYKLIEEIKNILKFTNAIPFKTKTTRYYCAYCSTDGPIFEDGDCLRLHTRLEHDKERLRGVEKFMRPQWMNEIVKLDIEGLHCTACRVRLPDWNRMFVHFANAHNIEFDQAYTRVIPYALTSQLRCVLCKEGFHNYGHVDSHMNHHYSNYICHQCGDTFLAASRFDKHLKVHKIGNYPCELCDKVFTLEKYRTKHKNLVHDQKSMVKCLYCSEKFVGLFQRHLHVTEQHKDKVKVYTCELCGNCYTWKTYFLAHMRRRHGTDKKHQCKHCDKSFLMKYELRNHMIRHTDEKFLCGLCGKSFKRMLTLKKHCLAHEEFSAED-