Monarch geneset OGS2.0

DPOGS202395
TranscriptDPOGS202395-TA1098 bp
ProteinDPOGS202395-PA365 aa
Genomic positionDPSCF300515 + 14855-19482
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0107346e-6841.53% 
BombyxBGIBMGA001777-TA7e-4131.82% 
DrosophilaCG5245-PA1e-2729.64% 
EBI UniRef50UniRef50_UPI000223636B2e-3832.95%UPI000223636B related cluster n=1 Tax=unknown RepID=UPI000223636B
NCBI RefSeqXP_001946669.15e-3631.32%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp), partial [Acyrthosiphon pisum]
NCBI nr blastpgi|3442979992e-3934.20%PREDICTED: zinc finger protein 26-like [Loxodonta africana]
NCBI nr blastxgi|3443082522e-4832.61%PREDICTED: hypothetical protein LOC100661788 [Loxodonta africana]
Group
Gene OntologyGO:00036764.9e-09nucleic acid binding
KEGG pathway 
InterPro domain[333-360] IPR0130874.9e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL23304 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202395-TA
ATGTTGGAGGAGAGACAAGTTGAGGCCAAGAAGCCGAGTTATCTGAGACTTCCATACAAATGTGATCTCTGTATAACGGCTTTCGACCACGAGTTGACTTTGAAGAGTCATATAGAGTCTAGACATAATAAATCTGGTGAATACGAGTGCTGTGTTTGCAAGTCTCACCTCTCCACTAAAATATCATTTGACGAGCATTACAAGAGACATTTCAGACGATACGAGTGTATTGAATGCGGTAAGCGAACAAACAACCTGTACCCGCTCCTCAAACACTACAACGACAACCACGGCCTCATACAGCTGGAGTTCTCGTGTAAGACCTGCGACTACAAAACCGACAAATACAGGGTGTTCAGATATCATCTGGAGAAACATCGGCAGAGTAACGTGGAGTGTGACCTGTGCGGGAAGACGTTCGTCAATAACAACGGACTGAAAACTCATCTGTACACGGTTCACGGCCAGTCGAGTCGCGTGTACGGCTGTGATAAGTGCAACAAGGTGTACAAGGCTAAGTCCGGTCTGAGCGCGCACATCGCGACGCACTCCTCGCCGTCGAGTCGCGTGTACGGCTGTGATAAGTGCAACAAGGTGTACAAGGCTAAGTCCGGTCTGAGCGCGCACATCGCGACGCACTCCTCGCCGGTCTACTGCAGGGACTGCGACACGCACTTCAGGACACCGCACGGACTCAGACACCATCTCAAGACTCACTCTAGACACGTGGAGGATAATGATAAGAGGTTCGTGTGCAAAGATTGTGATCTGAAGTTCCTAACGCCGAAGTCCCTGAGGGAGCACGTGGATTGGGTTCACTTGAACGACACGAAATACGAGTGTGACTCGTGCTCTAAGGTGTTCAAGAATAAGAACAGCCTGAAGAAACATTTTCAATACGTGCACGAAAAGAAGAGACCTCCGAGGAATAAGATCTGTGATCACTGCGGCAGAGCATTCACTACGCTACAAATCCTCCGATCCCATATCCACACGCACACGGGCGAGCGGCCGCACCGCTGCGACGTCTGTGGCGCCTCGTTCGCTCACAAGGGGGCGCTTTACACACACAATAAGTTATTACATAATAAACAATAG

Protein sequence:

>DPOGS202395-PA
MLEERQVEAKKPSYLRLPYKCDLCITAFDHELTLKSHIESRHNKSGEYECCVCKSHLSTKISFDEHYKRHFRRYECIECGKRTNNLYPLLKHYNDNHGLIQLEFSCKTCDYKTDKYRVFRYHLEKHRQSNVECDLCGKTFVNNNGLKTHLYTVHGQSSRVYGCDKCNKVYKAKSGLSAHIATHSSPSSRVYGCDKCNKVYKAKSGLSAHIATHSSPVYCRDCDTHFRTPHGLRHHLKTHSRHVEDNDKRFVCKDCDLKFLTPKSLREHVDWVHLNDTKYECDSCSKVFKNKNSLKKHFQYVHEKKRPPRNKICDHCGRAFTTLQILRSHIHTHTGERPHRCDVCGASFAHKGALYTHNKLLHNKQ-