Monarch geneset OGS2.0

DPOGS205191
TranscriptDPOGS205191-TA1641 bp
ProteinDPOGS205191-PA546 aa
Genomic positionDPSCF300265 - 404883-409571
RNAseq coverage104x (Rank: top 60%)
Annotation
HeliconiusHMEL0134483e-13956.43% 
BombyxBGIBMGA008758-TA0.067.38% 
DrosophilaCG12299-PA2e-2530.77% 
EBI UniRef50UniRef50_D6WJ675e-2832.50%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJ67_TRICA
NCBI RefSeqXP_001815603.11e-2832.50%PREDICTED: similar to Zinc finger protein 26 (Zfp-26) (Protein mKR3) [Tribolium castaneum]
NCBI nr blastpgi|3266667304e-2824.51%PREDICTED: zinc finger protein 729 [Danio rerio]
NCBI nr blastxgi|3266667302e-3425.55%PREDICTED: zinc finger protein 729 [Danio rerio]
Group
Gene OntologyGO:00036765.3e-09nucleic acid binding
GO:00056341.9e-06nucleus
GO:00082701.9e-06zinc ion binding
KEGG pathway 
InterPro domain[512-540] IPR0130875.3e-09Zinc finger, C2H2-type/integrase, DNA-binding
[15-91] IPR0129341.9e-06Zinc finger, AD-type
Orthology groupMCL25536 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205191-TA
ATGAACGGTATTAAAGGTAAGGGACCTGTGTTCGACCCGGGTCTCTGCAGATGTTGCGGTGCCATCAAAAAGTGCAGACTTTTAAATTACGAGTACGAAAGCTTTGGAAATAAAGAAATATATTCTGACATGATCATGGATTGTTACGGACTCGTCCTGTCTCATTTAGACGGGAAGCAGACTGAGAGGCTGGTGTGTGCGTCTTGTGTCCACCGTCTAAGGGACGCTATGGCATTCAGACAGCAAGTGCTGAAATGCGAAGAGGCATTCCTCCAGGTCAAGATATACGACGAAAAAGAGAATCACGCGGAAGAACAGAAAATCGAGGTGGAAGTCAAGCTTGAACCGGAAGAGATCAGCAACGAGACCGAGATGGCCAAAGACGCGCTAGCAGACGAGTCGCAGACTGACAACGAAATTAAACCGGATATGGCAATAAATGCACCAAACGAGGACATTCCGCTACAGGAAATAGTGACGGAGGATGAGGACAACAAATTAGATAAGATCAACGTAGAAGCTTTGCCTAAAAGATTTAAAACCATGAAGAGATTGTCAGCGGAGAGAAGCAAAATCCCAATGTCGAAACTTCAGCAACGGATGAGGGAGCGGGACCCCACGTATATAACTGAGACGAACATACTCACAATCGTTGAGTTTTCCTACGCGTGTCCATTCAAATGCCGCCACAACCACCTGCTGTGCTTCTACTGCGGTCAGAACTTCTCCGACCCGCAGCAGCTCAGGGACCACACATCGCAGTTCCACCATCCAAGGAAATTCAAAATAACGGACCACAAAAACATGCTCAAGTTGGACCTCACCCGGATAGACTGCAGGCTCTGCGGCCACAAGACGGACGACCTCGACGATTTCAAAACCCACGTGACCGATGTCCACAAAAAGAAATATTACTTCAACGTCAAAGATCTGATGCTGCCATTCAAGTTGTCCAAGGACGAGTTCAAGTGCGCGCTCTGTGACGTCATCTTCCCCTACTTCCACGCTCTCAACAAGCACATGAACGAGCACTTCAGCAACTACGTCTGCGAAACCTGCGGGCTCGGGTTTGTGGACCGCGGGAGGTTCCTGATGCATCAGCAGCGGCACGAAGAGGGCGACTTCCCGTGCGAAGTCTGCGGGAAGGTTTTCAAAGCCCAGTACAATAAAGAACTGCATATTGATCGAGTTCACAAAAAGAAAGGTCGTGTGTATTGCCCGAAATGCGACGTGAGGTTAATGAACTACCCACAAAAACTGAAACACCTGGTCGAAGTCCACGGCGAGGAGCCCCTGTCCTTCAGCTGTAACATGTGCGACAAAGTCTGCGAGACGCGAAGAAAGCTGACAATACACAGACGGAAAGAACATTTGAAAGATTACAGATACGAATGCCAGTGTTGCGGGCAGAAATTTTTCACACGTTTCGCCCTGACCAACCACATGCCGACACACACAGGCGAGAGGAACTTTAAGTGCAAGGTCTGCGAGAAGACCTACCCTCGCCTGAAAACATTGAAAGACCACCTCCGCATCCACACCAACGACAGGAGATACAGATGCCACATATGCGGACAGGCCTTCATACAGAACTGCAGCCTCAAAGGGCACATGAAGAGCCAGCATCCCGAATATAGCTAG

Protein sequence:

>DPOGS205191-PA
MNGIKGKGPVFDPGLCRCCGAIKKCRLLNYEYESFGNKEIYSDMIMDCYGLVLSHLDGKQTERLVCASCVHRLRDAMAFRQQVLKCEEAFLQVKIYDEKENHAEEQKIEVEVKLEPEEISNETEMAKDALADESQTDNEIKPDMAINAPNEDIPLQEIVTEDEDNKLDKINVEALPKRFKTMKRLSAERSKIPMSKLQQRMRERDPTYITETNILTIVEFSYACPFKCRHNHLLCFYCGQNFSDPQQLRDHTSQFHHPRKFKITDHKNMLKLDLTRIDCRLCGHKTDDLDDFKTHVTDVHKKKYYFNVKDLMLPFKLSKDEFKCALCDVIFPYFHALNKHMNEHFSNYVCETCGLGFVDRGRFLMHQQRHEEGDFPCEVCGKVFKAQYNKELHIDRVHKKKGRVYCPKCDVRLMNYPQKLKHLVEVHGEEPLSFSCNMCDKVCETRRKLTIHRRKEHLKDYRYECQCCGQKFFTRFALTNHMPTHTGERNFKCKVCEKTYPRLKTLKDHLRIHTNDRRYRCHICGQAFIQNCSLKGHMKSQHPEYS-