Monarch geneset OGS2.0

DPOGS207385
TranscriptDPOGS207385-TA2160 bp
ProteinDPOGS207385-PA719 aa
Genomic positionDPSCF300267 + 56137-65190
RNAseq coverage846x (Rank: top 15%)
Annotation
HeliconiusHMEL0122407e-14870.82% 
BombyxBGIBMGA009009-TA6e-15582.87% 
Drosophilacrol-PE1e-7534.53% 
EBI UniRef50UniRef50_Q17CV91e-11162.15%Zinc finger protein n=1 Tax=Aedes aegypti RepID=Q17CV9_AEDAE
NCBI RefSeqXP_001649111.12e-11262.15%zinc finger protein [Aedes aegypti]
NCBI nr blastpgi|1571059824e-11162.15%zinc finger protein [Aedes aegypti]
NCBI nr blastxgi|1571059822e-11462.15%zinc finger protein [Aedes aegypti]
Group
Gene OntologyGO:00036762.9e-15nucleic acid binding
KEGG pathway 
InterPro domain[486-510] IPR0130872.9e-15Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL18232 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207385-TA
ATGTTCGAACAACAAATCAAGGCAGAGCCCATGAGCTTTTACACACATTCACATATAAATACTGGACCCCCAACGATAATGCGCTCGGATTCCGGCCATGGCATAATCAGTATGAATCAGCACCACCCCCAGGAGGACTCCAAGGACAGTCTTATACAACAGCAAGTACAACACCAACAAGAGCTGATGGAACAGCACCAACAGGACTTGCAGCACGACGATGATGTTGATAATTTAAGCTTCAAAGGCATGGATGATGAAGGTGTTGAATTGGACATGGACGGCAGACAATGTTCTCAGGGCATGGTCGACATGGGATCAGTTCAAACCAAAATGGAAGTCACAAACGGGGGTGGGATGCCAAGATCGAAACCACAAGCTTGTAAAGTGTGTGGTAAGGTGTTATCATCTGCCTCGTCATATTATGTTCACATGAAGCTACATTCTGGCAACAAACCTTTTCAATGCACGGTATGCGACGCAGCGTTTTGTCGCAAGCCTTATCTTGAGGTGCACATGCGCACGCACACTGGCGAGCGTCCCTTCCAGTGCGATCTGTGCCTAAAACGCTTCACACAGAAGTCCAGCCTCAACACGCACAAGCGCGTACATACCGATGAGCACATGCGAGCCTTGATGGTGAAGGATCGACCCTACCAGTGTGAGGTCTGTCTGATGCGCTTCACTCAGAGCTCCAGCCTCAACAGACACAAGAAAATACACACGGAGGAGCACAGACGAGCCCTGTTAGAAAAAGTGCGGCCGTACCAGTGCCACATCTGTTTTATGCGCTTCACTCAGAAGTCCAGCCTGGGCCGACACGGAAAGATACACACCGAGGAGCACATCCAATCGCTGATCAACAAAGTGCGCCCCTATCAATGCGACATCTGTGACAAGCGGTTCACTCAGAAGTCCAGCCTTGGCACTCATAAACGTATACACACCGTCCAGGGGAGACCGTTCCAGTGCCTGTCGTGCCCGGCCGCCTTCACCTGCAAGCAATATCTGGAGATACACACGCGCACACACACAGGCGAGCGGCCCTATCAGTGCGACATCTGCCTCAAGCGGTTCACACAGAAATCCAGTTTGAACATCCACAAGCGGACGCACTCAGGTATGTGTCGGGGAGTTCGGGAGAGGCCGGCGGGCGGGAAGCGACGTCTCCGACCCTCGCCCACTGACCGTGTGTTTACAGTTCAGGGCCGGCCGTTCCAGTGTCTCCAGTGCCCGGCCGCCTTCACCTGCAAGCAGTACCTCGAGATACACAACCGCACGCACACCGGCGAGCGCCCCTACCAGTGTGACGTCTGCCTCAAGAGATTCGCGCAAAAGTCTACACTCAACATACACAAAAGAACGCACACAGTGCAAGGGAGGCCGTATCAGTGCATGGAGTGCCCGGCGGCGTTCACATGCAAGCCGTACTTGGAAATACACATGCGCACTCACACTGGCGAGCGTCCCTTCGAGTGCGATGTCTGTTACAAACGCTTCACCCAGAAATCAACACTCAACATTCACAAGCGAATTCATACCGGCGAACGTCCATACGCATGTGATATTTGCCAAAAACGATTCGCAGTTAAAAGCTACGTAACAGCGCACCGATGGTCCCACGTGGCGGACAAACCCCTGAACTGCGAGCGATGTTCTATGACGTTCACTTCCAAGTCCCAGTTCGCCCTCCACATCCGGACCCACGCCAGTGGACCCTGCTACGAGTGTAGCGTCTGCGGCCGGACCTTCGTCAGGGACAGTTATCTCATACGTCACCACAACCGCGTACACCGTGAGAACCACAGCAACATATCAGCGAACAGCATCGGCACCATCAACAGCGTGGCCACCAACACCAACTCCAACAACGGCAACTACGACTCGCCAGGCGTCTGTGACCTCAGCTTCGTGCCGATGGTGAATCGCTACATGACGTCTCAGGGCACGCAGGTTTCCATGCAGGACACGAAAATGTCCGCCATGTCGCCGCAGTCGATCGCCGTCGCCGCCGCCCCCGCACACCCCCACGCCCCAGCCGCAGATGTCCATGCGACTGTCGGATTGATCCGCACTACAACAGCTGATAACTCTCGATTGTTCGTTGGACGGCCCGAGTGCAATATACGAAGGAGGCAGCCATCACGAGCAGCTACATAG

Protein sequence:

>DPOGS207385-PA
MFEQQIKAEPMSFYTHSHINTGPPTIMRSDSGHGIISMNQHHPQEDSKDSLIQQQVQHQQELMEQHQQDLQHDDDVDNLSFKGMDDEGVELDMDGRQCSQGMVDMGSVQTKMEVTNGGGMPRSKPQACKVCGKVLSSASSYYVHMKLHSGNKPFQCTVCDAAFCRKPYLEVHMRTHTGERPFQCDLCLKRFTQKSSLNTHKRVHTDEHMRALMVKDRPYQCEVCLMRFTQSSSLNRHKKIHTEEHRRALLEKVRPYQCHICFMRFTQKSSLGRHGKIHTEEHIQSLINKVRPYQCDICDKRFTQKSSLGTHKRIHTVQGRPFQCLSCPAAFTCKQYLEIHTRTHTGERPYQCDICLKRFTQKSSLNIHKRTHSGMCRGVRERPAGGKRRLRPSPTDRVFTVQGRPFQCLQCPAAFTCKQYLEIHNRTHTGERPYQCDVCLKRFAQKSTLNIHKRTHTVQGRPYQCMECPAAFTCKPYLEIHMRTHTGERPFECDVCYKRFTQKSTLNIHKRIHTGERPYACDICQKRFAVKSYVTAHRWSHVADKPLNCERCSMTFTSKSQFALHIRTHASGPCYECSVCGRTFVRDSYLIRHHNRVHRENHSNISANSIGTINSVATNTNSNNGNYDSPGVCDLSFVPMVNRYMTSQGTQVSMQDTKMSAMSPQSIAVAAAPAHPHAPAADVHATVGLIRTTTADNSRLFVGRPECNIRRRQPSRAAT-