Monarch geneset OGS2.0

DPOGS214559
TranscriptDPOGS214559-TA1665 bp
ProteinDPOGS214559-PA554 aa
Genomic positionDPSCF300266 + 59468-61588
RNAseq coverage117x (Rank: top 58%)
Annotation
HeliconiusHMEL0031610.065.82% 
BombyxBGIBMGA003283-TA0.059.12% 
Drosophilacrol-PE2e-3131.01% 
EBI UniRef50UniRef50_D6WM195e-6732.82%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WM19_TRICA
NCBI RefSeqXP_001120631.12e-3729.41%PREDICTED: similar to zinc finger protein 235 [Apis mellifera]
NCBI nr blastpgi|2700077592e-6632.82%hypothetical protein TcasGA2_TC014456 [Tribolium castaneum]
NCBI nr blastxgi|2700077593e-7033.07%hypothetical protein TcasGA2_TC014456 [Tribolium castaneum]
Group
Gene OntologyGO:00056349.8e-12nucleus
GO:00082709.8e-12zinc ion binding
GO:00036761.7e-09nucleic acid binding
GO:00056226.7e-06intracellular
KEGG pathway 
InterPro domain[8-84] IPR0129349.8e-12Zinc finger, AD-type
[409-435] IPR0130871.7e-09Zinc finger, C2H2-type/integrase, DNA-binding
[473-496] IPR0070876.7e-06Zinc finger, C2H2
Orthology groupMCL19023 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214559-TA
ATGGAAACTTCTGAAACTATGTATTGCCGCTTATGTGCTGAGCAAAAGAACGCTGCTGATATAATTAACCGTGATAACATCGCACAGAATTTTGGAGAAATAGTTGCTAAATTGAACTTCTTAAACTCACTTTTCATAGATTTATTTAAAAAGGAGCAAATGTTGCCGAAAACGGTTTGCATAAGTTGCTACGAGGGATTGAATAAGGCTTATGCTTTTGTGAATAAAGTACAAAGAGCTCAAGAGCTGCTAGTTAATATTTTCCAAGAAAACATCGAAAAGTATGCTTCTTCAGACGACGATAAGGCTGGGCTCGATGAATTCCTAAACCCGGCTGAGCTGAGCGGTAATGAAGACGTAAAACAGGAATCCATAGAAATAGAAATACCGATACCTGTACCAGCTAAAGTGGAAGTCAAGAAAGAACCGAAAGATGAGATGGATAATGAGCAAAGTCAGGAGAGTACGTTTGACAACACCCTGAATGTCCAAGACCTGTTAAACGCGGCGATGTGCAATGTGCCATATTCCGAAGTAACATTATATGCTAAGGAAGTCAAGGATGCAAGCAAAAGGTCAATAACATCATGGAAAGAGTACCCCTGGCTTTGTGCGCATTGCGATATAGAGTTTCTGGATATAGTGACACTCCGGATGCACTCGAAAATGACACACGGGAAGTGTAATGCGTTTGTATGTATAGATTGTGATAATTGCGGTACCGGCGACTTTGAATATTTTATTAATCATGTTAAGTCGCACAGGAAGAGGCTCAGGAAAAGATGTTACTATTGCGATGTAGTTGTGAACGAAGAACTACTTGTAGAACACGTCCAGGAACACATACAGAAATATCAAGTACCATGCAACATGTGCGGAGAGATATTGGACAACAGTGAAACTCTGTTTAAACATCTAGAAGAATACAACTCAACGAAACCCAAACGGAAACCCAGAAGAAAGAGAGGCACACCGCTGACCGTTGAAGATCTGACATGCGAATTATGTAAGAAAGTGTACAAAAATCCTAACAGCCTGCGGGATCATATGAAAATACACAACGGTGATAGAAAGAGGAATTACACTTGCGATCGGTGTGGGAAAATGTTCTATAACAAAGGGACCTTGTCATCCCACATTTTGGCACACGACCAAATCCGGCCTCATATATGTAGGATTTGCAACAAATCCTTCCTATTCCCAAACATGTTGCGACGGCACGTCGAGATGCACTCCGGTGTAAAGCCGTTCTCATGCGAGCAGTGCGGTCGTTGCTTCAGACTGCAGTACCAACTTAATGCGCATAAAATTATTCACACGGATTCTATGCCCCACGTCTGCCAATATTGCAATAAAGCGTTCAGATTTAAACAGATACTGAAAAACCACGAGAGACAGCACACTGGGGCCAAGCCGTACGCATGTCAGAACTGTGGCATGGAATTCACTAACTGGTCCAATTATAATAAGCATATGAAAAGGCGGCACGGCTTGGACACGTCTAAAAAGAAGATCACGCCCGACGGAGTGTTCCCTATAAATCCACAGACAGGACAAATTGTTCAACTAAACGACGCCAGTACAGAAGAATGGAAGACAAAGATCATGGTGCCGGGCAAAAGAGGCAAAAAGAAGATTATAAAGAAGGTTGAAGATGTAGCATAG

Protein sequence:

>DPOGS214559-PA
METSETMYCRLCAEQKNAADIINRDNIAQNFGEIVAKLNFLNSLFIDLFKKEQMLPKTVCISCYEGLNKAYAFVNKVQRAQELLVNIFQENIEKYASSDDDKAGLDEFLNPAELSGNEDVKQESIEIEIPIPVPAKVEVKKEPKDEMDNEQSQESTFDNTLNVQDLLNAAMCNVPYSEVTLYAKEVKDASKRSITSWKEYPWLCAHCDIEFLDIVTLRMHSKMTHGKCNAFVCIDCDNCGTGDFEYFINHVKSHRKRLRKRCYYCDVVVNEELLVEHVQEHIQKYQVPCNMCGEILDNSETLFKHLEEYNSTKPKRKPRRKRGTPLTVEDLTCELCKKVYKNPNSLRDHMKIHNGDRKRNYTCDRCGKMFYNKGTLSSHILAHDQIRPHICRICNKSFLFPNMLRRHVEMHSGVKPFSCEQCGRCFRLQYQLNAHKIIHTDSMPHVCQYCNKAFRFKQILKNHERQHTGAKPYACQNCGMEFTNWSNYNKHMKRRHGLDTSKKKITPDGVFPINPQTGQIVQLNDASTEEWKTKIMVPGKRGKKKIIKKVEDVA-