Monarch geneset OGS2.0

DPOGS207012
TranscriptDPOGS207012-TA2157 bp
ProteinDPOGS207012-PA718 aa
Genomic positionDPSCF300001 + 1152427-1156376
RNAseq coverage159x (Rank: top 52%)
Annotation
HeliconiusHMEL0154670.056.12% 
BombyxBGIBMGA012937-TA7e-5142.33% 
DrosophilaCG7372-PA1e-2030.41% 
EBI UniRef50UniRef50_UPI00020277D03e-3223.11%UPI00020277D0 related cluster n=1 Tax=unknown RepID=UPI00020277D0
NCBI RefSeqXP_002733160.16e-3625.61%PREDICTED: zinc finger protein 197-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2608052022e-3525.73%hypothetical protein BRAFLDRAFT_80515 [Branchiostoma floridae]
NCBI nr blastxgi|3504133531e-4022.49%PREDICTED: hypothetical protein LOC100748408 [Bombus impatiens]
Group
Gene OntologyGO:00036765.5e-10nucleic acid binding
GO:00082707.4e-06zinc ion binding
GO:00056227.4e-06intracellular
KEGG pathway 
InterPro domain[175-204] IPR0130875.5e-10Zinc finger, C2H2-type/integrase, DNA-binding
[185-207] IPR0070877.4e-06Zinc finger, C2H2
Orthology groupMCL26020 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207012-TA
ATGGAACCCAACATTTCATTCTTCAGTCTTCCATTTGTGGATTCCGAATCCACTGATTTGGTATTGAAGAAGACTCAAGATGGTGAATATGATTTCACAAATGACCTGCCTTCGCATGGTTCATTATTGGGCACTGACGATGTTTTGGCTCAGTTTCTATCCGCTGATGGACCTCTCGAGGAACCCGACCTTAACGGACCTACAACATTGCACTGTGAGATATGCAGTAAGAAGTTTGATAATGCCAAGAAGTATTATGGTCACTTGAGAGTACATTCTAAGGATAATCTTTGGATTTGCGACAAGTGTCCAGACCAGAAATTTGCTGTAAAACAACAGCTGATGAAGCACAGCTTAATACACAAGCCACTCGCCAGAGTTTGGAAGTGTCCGCAGTGCTCTATGGCATTTGAGGCCTTGTGGCGTTTGCAGCAACATCTTTTTGCTAAGCACTTGGATTATAGGCCCCACAAGTGTGAGCAATGTGACAAATGCTTTCATAAACCATCAGATTTAAGAGAACATTTGGACACCCATAAAGATATTAAGGCCCATGTCTGTCCGAACTGTGGAAGGGGTTTCAGTCATAAATCGAACTTGAAAAGACATATGATATTGCACGGAAAGGAGAAGCCTTTCAGTTGCATCGGATGTCAAACTAGGTTCACACAGGTAGCATCACTGAGACGTCATCAGAAGAATTGCTCTCAATTCAAATCCCAGAATCCAGGTCAACCACAGGACAAAACTAGGAAGAATTACTGCAGGGTTTGTGGGGCCGCTTTTCAATATAAAAGTTCCTTGCTGGAACATTGTATAAGACAACACACACCTCATAAAGAAGAGAAAGAAAAGAGTACTCGTAGTAAAGACGCAACAAGAGCCGAGGACAATATTGTTGATGATATATTATCTGTCGAGGATGATTACATGACTATGTCGAACCATCAAAGCGGTCTAAGTTACAACCATCAAGTAACGAACGATTCTGCCTCATACGACAACTTGATGCAAATAGAATTCCTTAAAGAAATGAATCAACTACACATCCTGGACGATGAACTGCTCTACAACGACATAGATTTCGATAGCTTCCAAAATAATCAAATATTTAATATAAACCCGGGCGATATAGATTGTGGTAACGACAAAAGTGAAGTCATGTTCGATTTCACCGATACAAGCAAGGGTGAGGATCAAGATATAATGAACGCGATATACCAAGTCAAAGCGGAACTATTACCGGATGAGCTCTTGAACGTGGAGACCGTAAACGGTAATAAACATCAGGAATTACAGAACGCACAGGTCACTGTGAACGAATGTGCCACTATATTCGAGAGTGACGTAGATTTAGAGGCAAGTAGTAATTTGGCTGCGAACTTAAATCAGCTGATCGGTGAAAACAGTGTTCAGTACATTTCCACGGAAGACGACGATATGTTCATCATAAGTCTAAACAGTGAGATCGATGCTGAACAGCTGACGGACATGTTGAACATAGGCGTCGGTGTAGTGAGCGGTAAAGTGGAAGAGGCCACGGAAATAAAAACCAAAGAATTAGACAAGAGCAAAAGTTTGCTGGACAACATACAACCGATCATATTGAAGATCGAACAGTCACAAGCTCCTAACACTGGTGATGGAGATACAAACGAAAAGGAGAACAAGCAAGGCAGGAAGTTGAAGCGGAAACGGGTACTGTATGTTTGTTCCACCTGTGACAAAGTATTCAAGAAGAGAGACAACTTTAGATCTCACATAGGTACTCACGAGGCTTCTCTCCGCCGTCACGTGTGTCCGGTGTGCGGTGACCGGTTCAGTTATCGTTCAACGCTCAACAAACACAGACGGGCCACGCACACACCGCACGTGTCTGAGAGCCACGAGTGTACGCGCTGCGAGCGATCATTCACCGCAGCTTGGATGTTGAAAAATCACATTGAACGTGATCACGATCGCCTGACTCCTTACGCTTGTGATTACGAAGAATGTGACAGGAAGTTTTTCAAAAAATGCGATCTGGTTGTACATAAGAGGCAACACACGGGGGAGAGACCATACACCTGTGAGATCTGCAAACAGAAGTTTCTTCACAGTTCCCATCTCAAAAGACACGAGCGTGGAGTCGACTGCGCCAAAAGGCGAAAAAGATAA

Protein sequence:

>DPOGS207012-PA
MEPNISFFSLPFVDSESTDLVLKKTQDGEYDFTNDLPSHGSLLGTDDVLAQFLSADGPLEEPDLNGPTTLHCEICSKKFDNAKKYYGHLRVHSKDNLWICDKCPDQKFAVKQQLMKHSLIHKPLARVWKCPQCSMAFEALWRLQQHLFAKHLDYRPHKCEQCDKCFHKPSDLREHLDTHKDIKAHVCPNCGRGFSHKSNLKRHMILHGKEKPFSCIGCQTRFTQVASLRRHQKNCSQFKSQNPGQPQDKTRKNYCRVCGAAFQYKSSLLEHCIRQHTPHKEEKEKSTRSKDATRAEDNIVDDILSVEDDYMTMSNHQSGLSYNHQVTNDSASYDNLMQIEFLKEMNQLHILDDELLYNDIDFDSFQNNQIFNINPGDIDCGNDKSEVMFDFTDTSKGEDQDIMNAIYQVKAELLPDELLNVETVNGNKHQELQNAQVTVNECATIFESDVDLEASSNLAANLNQLIGENSVQYISTEDDDMFIISLNSEIDAEQLTDMLNIGVGVVSGKVEEATEIKTKELDKSKSLLDNIQPIILKIEQSQAPNTGDGDTNEKENKQGRKLKRKRVLYVCSTCDKVFKKRDNFRSHIGTHEASLRRHVCPVCGDRFSYRSTLNKHRRATHTPHVSESHECTRCERSFTAAWMLKNHIERDHDRLTPYACDYEECDRKFFKKCDLVVHKRQHTGERPYTCEICKQKFLHSSHLKRHERGVDCAKRRKR-