Monarch geneset OGS2.0

DPOGS212670
TranscriptDPOGS212670-TA1887 bp
ProteinDPOGS212670-PA628 aa
Genomic positionDPSCF300198 + 135592-142236
RNAseq coverage114x (Rank: top 59%)
Annotation
HeliconiusHMEL0075187e-16651.37% 
BombyxBGIBMGA004432-TA2e-10739.71% 
DrosophilaCG34406-PA7e-2222.39% 
EBI UniRef50UniRef50_UPI00016E1CCF2e-2832.12%UPI00016E1CCF related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E1CCF
NCBI RefSeqXP_001649852.13e-2825.08%zinc finger protein [Aedes aegypti]
NCBI nr blastpgi|1232078211e-2730.08%novel KRAB box and zinc finger, C2H2 type domain containing protein [Mus musculus]
NCBI nr blastxgi|3352989339e-3728.13%PREDICTED: zinc finger protein 502 [Sus scrofa]
Group
Gene OntologyGO:00036764.5e-11nucleic acid binding
GO:00056342.5e-07nucleus
GO:00082702.5e-07zinc ion binding
KEGG pathway 
InterPro domain[535-563] IPR0130874.5e-11Zinc finger, C2H2-type/integrase, DNA-binding
[6-73] IPR0129342.5e-07Zinc finger, AD-type
Orthology groupMCL25018 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212670-TA
ATGTCGGAAATAAAGTTATGCAGAATTTGTTTACAAACAGAAACTAAGCTTTACAACGTGCACCAATACCAACTGAAAACGTATTATGATGAAGTGATAAATAATATTAACTCCGATATGGATGATCTACCCCAATACTTCTGTTTCGAGTGTGCATATTTGCTGTATAAGTTTCACAAGTTCAAAGAAAAGTGTAGTATTGGTTATAAAATACTGACAGAGATGTTGTGTCGAGGCCCAATAACCAACAACACAATTAACGATTCATATACATTTAATTCAAATATGAAACCATCATTAAAGATAGTGAATGTGTTCGAAATTAACTATACTTTCAAAGAGGAGCCGAAATATGAAGAAATTCATACAAAAGTTGAGAATGTTGATACTGAAATTGATAACAATGTATATACTGGTGAAGACAATCAGGATATAGATACAAATGCTAAATATATAGACTCACTTAATACAGAAGATGAGCTTAATATATATGAACATAAAAATCCAAATGAATCCAAATCACTACACACATATGATAATGAAAATGATAGTAAAAACGGAATACTAAGACATAAGATCTCATTGGATGAACCTTTTTGGAAGAAACATGAAATGTCAGAAGAAGAAGCACAATTCAAAGCAAGAGCGGAGAATGATAAATACAAAAGCGCACCATTCAAGTGTACAGATTGTTTTAAAGGTTTCTCAAAAGAGAATATATTAAAGAGACATAGAATCTTGAGGCATAATGAAATATATAAAATAGAATGTCCATTTTGTCACATGCGTTTTAAATTAAATTGTTTCATGCAGAAACATTTGCGGGGCCATTACACTAAGTATGAATGCAGGAGGTGTAATGTGGTGTATCCCTTGGAGGGTTCAGCATTGTTCCACGAGGAGTTCCACAGCGGTGTCATAAGAACCTGCAGACACTGCAATGAGGAGTTCCGTCACTCGTCAACATATTATTCTCACCTCCGAACACATAAGAGTGAGTTCGTGTGTTGTGTGTGCGGGTCGTCGTTCGTCAGCGAGGCCGGACTCCATCAACACAAGCGGATCAAACACTGTGACAGTGTTGAATCCCCTGACGATGAGGAGATGAACACTTTTTGCACTAAGTGTGACATCAGCTTTGAGAGCAGGCCGGCCTACGACGAGCATCTACTGCATTCCATCAAACACATAGAAGACATTGAGAATGATAATGAAGTTGTTCAAGACAAACGGAAGAAGGTTTTGGGCAAGAGAATGAAGGAAAAGATAACCGGTCAATTGTCCAAGAAGTCAAGGAATATGACAAAGTCGGAGAGGAAGAGTTGTAAGAGAAGAAGACAACCAAGGAAACCAACCACCTGTCATCAGTGCGGTAAACATTTCGACACCCAGGCGGCGTGCATGAAGCACCACGTGACGGAACATCCGCGGACGTCCTTCACGGCTCCACACCAGAGACACATCTGTGAGATATGCGGAGCGTCCCTCGCGCCGGGGAGCGTCATCGCTCATCAGAACATGCACAGCAGAGAAAAGGTCCACCCGTGTGAGACGTGCGGCAAACAGTTCTATACAACCATATCTCTCAAACGACACTCCGTGACTCACACCGGAGAGAAACCGTTCCCTTGTAGTTTATGCGACAAGAGGTTCACACAGAGCAACAGCATGAAACTCCACTACAGGACCTTCCATCTCAAACAACCTTACCCAAAAAGAAACAGAAGAAAGAAAAAGATGAATGATAGCATGGAAGAATCTCACAGTGAAGACTCCAGTGACGTGAAGACCAAGAAGAAAACAGTTCATGAACAGGGAGTCCAAGCCAGCGCTATCACTGTACAAGTTATAAGTGACACTAACAGCCTCTTCAACTTCTGTGGGTAG

Protein sequence:

>DPOGS212670-PA
MSEIKLCRICLQTETKLYNVHQYQLKTYYDEVINNINSDMDDLPQYFCFECAYLLYKFHKFKEKCSIGYKILTEMLCRGPITNNTINDSYTFNSNMKPSLKIVNVFEINYTFKEEPKYEEIHTKVENVDTEIDNNVYTGEDNQDIDTNAKYIDSLNTEDELNIYEHKNPNESKSLHTYDNENDSKNGILRHKISLDEPFWKKHEMSEEEAQFKARAENDKYKSAPFKCTDCFKGFSKENILKRHRILRHNEIYKIECPFCHMRFKLNCFMQKHLRGHYTKYECRRCNVVYPLEGSALFHEEFHSGVIRTCRHCNEEFRHSSTYYSHLRTHKSEFVCCVCGSSFVSEAGLHQHKRIKHCDSVESPDDEEMNTFCTKCDISFESRPAYDEHLLHSIKHIEDIENDNEVVQDKRKKVLGKRMKEKITGQLSKKSRNMTKSERKSCKRRRQPRKPTTCHQCGKHFDTQAACMKHHVTEHPRTSFTAPHQRHICEICGASLAPGSVIAHQNMHSREKVHPCETCGKQFYTTISLKRHSVTHTGEKPFPCSLCDKRFTQSNSMKLHYRTFHLKQPYPKRNRRKKKMNDSMEESHSEDSSDVKTKKKTVHEQGVQASAITVQVISDTNSLFNFCG-