Monarch geneset OGS2.0

DPOGS205190
TranscriptDPOGS205190-TA1638 bp
ProteinDPOGS205190-PA545 aa
Genomic positionDPSCF300265 - 411163-415243
RNAseq coverage46x (Rank: top 71%)
Annotation
HeliconiusHMEL0134460.068.67% 
BombyxBGIBMGA008755-TA9e-15850.00% 
DrosophilaCG12299-PA4e-2732.42% 
EBI UniRef50UniRef50_Q1LVT72e-3033.74%Novel protein (Fragment) n=2 Tax=Danio rerio RepID=Q1LVT7_DANRE
NCBI RefSeqXP_001815375.15e-3125.00%PREDICTED: similar to 9630041N07Rik protein [Tribolium castaneum]
NCBI nr blastpgi|3266673944e-3430.48%PREDICTED: zinc finger protein 569-like [Danio rerio]
NCBI nr blastxgi|3266673948e-4130.20%PREDICTED: zinc finger protein 569-like [Danio rerio]
Group
Gene OntologyGO:00036762e-10nucleic acid binding
GO:00056345.9e-06nucleus
GO:00082705.9e-06zinc ion binding
GO:00056223.3e-05intracellular
KEGG pathway 
InterPro domain[476-502] IPR0130872e-10Zinc finger, C2H2-type/integrase, DNA-binding
[15-92] IPR0129345.9e-06Zinc finger, AD-type
Orthology groupMCL34583 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205190-TA
ATGTACATTATTACAAGAAAGGGCCCTCTTTACGACCCGGGGTTGTGTAGATGTTGCGGTGCAATAAAGAAATGTCGCTTACTGAATGTGGAATACGAATGGCAGGGTCGAAAAGAAATTTATTCCGATTTATTTGTGGATTGTTTTGGACTATTGCTATCACATCTAGATGGAGAGGCGAAGGAGCGTCTGATCTGCGCCACCTGTGTCTCCCGGCTCAGAGAGGCGAGTACATTTCGGCAACAGGTGCTACAATGTGAAGAAATTCTGTTACGAACCAAGATACAAAGTGAAGGTGATAGTGACTTGGATTGCGTAAAAATTGAAGCTGAAATTAAGACGGAATGCCTGATGACCTCTAATGAAGATAGCAACGATCATAATGACGTCATAGATGATATTAAGCCTGAAAATAAAAAAAGAAAGTCCGTGAGGCAGAAGTTGAAGAAGAGAAGAAAGGAGGACGATCGAGACGTGAGGGAAGAAGATCTATACAATAGCAGAAGAATCAGCGAGAGACTGAAAGAGCTCGTAAATGTCGATCTGGCTGAAAAATCAACGCAAGCCCACAACGGTTACAAATTGGACAATTCTACGCCGTACGCTAATATCGTCACGATAGTCGAAAACTCCTACGCGTGTCCCTTCGAGACGTCCTTCAGCGATTACTTCTGTGTCTATTGCAGACATCTATTCACTGACCCAAACAAACTAAGAGAACATACCCTCAATCACGATCCGCTGACCTTCAAGGAGGTCCTCTCAAACAGCGCCAACAACAAGAAAGTCCAAATAGACATGTTCCGAATAGACTGCAGGTTGTGCCCACAGGTTATAAATGATTTCGAAACGCTGAGAGATCATTTGAACAGCTCGCATCAGATAAAATTGAATCTAGTGTCGAACGAGTTCCTTAAATTTAGACTGACTTCCGGTTCGATTGCGTGCACGGAGTGCGGCACCAGCTTCAGCTTCTTCCACGCCTTGAAGAAGCACATGGCGGAGCACTTCGGGACGTGCATATGCGATGTATGCGGCGCGCATTACTTCGAAGAGCGCATGCTAGTGTTGCACCAGAAAACCCACCAAAAGAACGAGGAGTGCTTCACTTGCAAGGAGTGCGGGAAGAATTTCAAGTCCAAATACTCGAGATACATCCACATAGCGAGGCTCCACAAGAAGGAAGCGGCTTACCAGTGCAGCAAGTGCGATGAAGTGTTCTTCTCGTACAGCTTGAGATACCGGCACATGATAGACGTCCACGGCGAGGAGAGGACCTTCCAATGCGAGCAATGCGACCGAGCTTACGACAGCAGGAAGTCCTTGCGGGAGCACAACAGACGTTTCCATCTCAAAATCCTCAAACATCAGTGCGAGTTGTGCGACAAAAGATTCTATCTGCCGTCGAGACTGAAAGAGCACATGGCCAGCCACACCGGAGAGAGGAACTTCCGCTGCGAGTACTGCGGGAAGAGCTACCCGAGGCTGCGAGGTCTGAAGGTCCACATGCAGTCGCACAGCAGCGACAAGAAATTCAAATGCGTTATGTGCGAGGCGTCCTTCACCCAGAACGTGAATCTGAAAAATCATATCAAGAGACAGCACCAAAGCCTGGAATTAGACGATTACAACGACTGA

Protein sequence:

>DPOGS205190-PA
MYIITRKGPLYDPGLCRCCGAIKKCRLLNVEYEWQGRKEIYSDLFVDCFGLLLSHLDGEAKERLICATCVSRLREASTFRQQVLQCEEILLRTKIQSEGDSDLDCVKIEAEIKTECLMTSNEDSNDHNDVIDDIKPENKKRKSVRQKLKKRRKEDDRDVREEDLYNSRRISERLKELVNVDLAEKSTQAHNGYKLDNSTPYANIVTIVENSYACPFETSFSDYFCVYCRHLFTDPNKLREHTLNHDPLTFKEVLSNSANNKKVQIDMFRIDCRLCPQVINDFETLRDHLNSSHQIKLNLVSNEFLKFRLTSGSIACTECGTSFSFFHALKKHMAEHFGTCICDVCGAHYFEERMLVLHQKTHQKNEECFTCKECGKNFKSKYSRYIHIARLHKKEAAYQCSKCDEVFFSYSLRYRHMIDVHGEERTFQCEQCDRAYDSRKSLREHNRRFHLKILKHQCELCDKRFYLPSRLKEHMASHTGERNFRCEYCGKSYPRLRGLKVHMQSHSSDKKFKCVMCEASFTQNVNLKNHIKRQHQSLELDDYND-