Monarch geneset OGS2.0

DPOGS205195
TranscriptDPOGS205195-TA1353 bp
ProteinDPOGS205195-PA450 aa
Genomic positionDPSCF300265 - 366185-373671
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0045461e-3126.01% 
BombyxBGIBMGA008789-TA3e-2933.17% 
DrosophilaCG15269-PA5e-1528.72% 
EBI UniRef50UniRef50_UPI00016E22301e-1631.90%UPI00016E2230 related cluster n=2 Tax=Takifugu rubripes RepID=UPI00016E2230
NCBI RefSeqXP_001607998.11e-1624.22%PREDICTED: similar to zinc finger protein [Nasonia vitripennis]
NCBI nr blastpgi|3442977598e-1734.17%PREDICTED: zinc finger protein 449-like [Loxodonta africana]
NCBI nr blastxgi|1700711695e-2223.11%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00055151.2e-09protein binding
GO:00036765.6e-08nucleic acid binding
GO:00056341.9e-07nucleus
GO:00082701.9e-07zinc ion binding
GO:00036772.2e-06DNA binding
GO:00063552.2e-06regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[132-195] IPR0090571.2e-09Homeodomain-like
[413-446] IPR0130875.6e-08Zinc finger, C2H2-type/integrase, DNA-binding
[12-88] IPR0129341.9e-07Zinc finger, AD-type
[140-189] IPR0122872.2e-06Homeodomain-related
Orthology groupMCL30616 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205195-TA
ATGTCAGTAGAAATCTGGAAAATTGAAAAAAATTTATGTCGCTGCTGCCATTCGGACGGTGCTTTCGATAATTTAGCTGAACCTAAACAAAATTTGGATCAAGAGGAAATATATTCTAATATGTTAAAGGAATGTTTTAATATTGAAATCATCCCAGTGCCGGGGGAGTTGTGTACGGCTACATACACGATATGCAAAGCGTGCATCTCGCGACTGCAAGACGCCTCGAGCTTCAAGAAACAGGTCTTGGACTGCGAAGGGAGATTCATGGACTTGTACCTAAACAACACTATCAAAGAATTCAAAGACAGCTTCATCAGTTTGGATGATGGCCACGATGAAGACTATGACATCAAACAAGATGAAGCCATGACAAGTGAAAAGGAAACAAAACCTAAGTTAAAAAGGAAATTTATTTCTCTACAACAAAAAATTGATATTTTGGATCAGCTAAGTAATGGCAAGAAATTAACAGCAATAGCAAAGGATCTGGAGCTAAACGAGTCGTCAATACGAACAGTTAAACAAAATGAAAGTAAAATTAGGAGTGCTGTGATGTCTAGATCGTTGCAGACCTTAAATGACCCGGATTCAGACGAACTGAACATAACACTGAAGAATAGGAAAGCTAAAGTGAAACAAAAAATGGAGAAGACGAGGAGCAGACAAAGGACAAAGCAGGACGCAAAGAAAACAAAGAGCTTGGGAACAAAGGAAAAGAAAGGAGAAGCAACGTTCAAGCTTCAAGCTAAAGATTATAAGTGGGATGATGAGACAGTGTGCTGCTGTCATTGCGGTAAGAGATACGGCAAGATGTCGTCGCTGAGGTTTCACGTTAAGACCAAACACTACAAAATACCTAAATACAGGTGCCCCGAGTGCTTGGAGGAGTTCATGACGGTGCCGCAGTTCACCGTCCACAAGCTGGAGGTCCACAACATAGACCACAGGAAACGTGATTGTTTCAAACAGCACTACAGGCAGGTCCATCTCAAACAGCGACCGAAACTGATTGGCTGTTATTATTGTGAAGAGAAGGTTTCTCCTCACATGAGGGCCTACCACTTGGAGAAAGCCCATGGAGTCCCAGCGCCATCTTGCGGCGCCTGCGGGAAGAAATTTCCTTATCCCTTCCAAGTACTGAGACACCAGAAAACTTATCACATGGGCGAGAAGAAGTTCGTCTGTAACGTCTGCGATATGACTTTCGCGTCCAGGGGCAATTTGGTCCAACACCAAGTTAAGCATTCCTCACTGAGGCCGTTCAAATGTGATTTGTGCGAAGAGACGTTCAAGTTGAAGAAGCATCTGTCTAGGCACAGATTGACGCACTTGAATCAGAGAAGATCTTAA

Protein sequence:

>DPOGS205195-PA
MSVEIWKIEKNLCRCCHSDGAFDNLAEPKQNLDQEEIYSNMLKECFNIEIIPVPGELCTATYTICKACISRLQDASSFKKQVLDCEGRFMDLYLNNTIKEFKDSFISLDDGHDEDYDIKQDEAMTSEKETKPKLKRKFISLQQKIDILDQLSNGKKLTAIAKDLELNESSIRTVKQNESKIRSAVMSRSLQTLNDPDSDELNITLKNRKAKVKQKMEKTRSRQRTKQDAKKTKSLGTKEKKGEATFKLQAKDYKWDDETVCCCHCGKRYGKMSSLRFHVKTKHYKIPKYRCPECLEEFMTVPQFTVHKLEVHNIDHRKRDCFKQHYRQVHLKQRPKLIGCYYCEEKVSPHMRAYHLEKAHGVPAPSCGACGKKFPYPFQVLRHQKTYHMGEKKFVCNVCDMTFASRGNLVQHQVKHSSLRPFKCDLCEETFKLKKHLSRHRLTHLNQRRS-