Monarch geneset OGS2.0

DPOGS201265
TranscriptDPOGS201265-TA1344 bp
ProteinDPOGS201265-PA447 aa
Genomic positionDPSCF300037 + 794631-796082
RNAseq coverage732x (Rank: top 18%)
Annotation
HeliconiusHMEL0121440.090.85% 
BombyxBGIBMGA008080-TA0.077.88% 
Drosophilanoc-PA2e-9145.16% 
EBI UniRef50UniRef50_F4WAL42e-11355.76%Zinc finger protein Noc n=7 Tax=Neoptera RepID=F4WAL4_ACREC
NCBI RefSeqXP_974922.13e-14461.89%PREDICTED: similar to zinc finger protein nocA [Tribolium castaneum]
NCBI nr blastpgi|910768607e-14361.89%PREDICTED: similar to zinc finger protein nocA [Tribolium castaneum]
NCBI nr blastxgi|910768602e-14861.52%PREDICTED: similar to zinc finger protein nocA [Tribolium castaneum]
Group
Gene OntologyGO:00036763.1e-06nucleic acid binding
KEGG pathway 
InterPro domain[325-356] IPR0130873.1e-06Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL14631 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201265-TA
ATGGTGGTACTTGAAGACGGAGTAATGATGACTACGAATCCGAATCAATACTTGCAACCAGATTATTTAACTCCACTTCCATCAACTTTGGACTCCAAAAAAAGTCCGCTTGCGCTTTTAGCGCAAACATGCAGTCAAATAGGCGCAGATACGCTTCCCAGTAAGCCTCTTCTGCCTCCACTTGAGAAAAAGAAAACAGTGAACAGTGTTAACAGTGATGCAATTAGTCGTTCTTCACCTAGTGCGAAACTGGATAAACCTCGTTCTTCACCGGAGAGTAAACATTTAGCTTTCAAACCATATGAAACTAATGTTGTTACGAAGAAACCTGAAGAAACAAGACCGACTTCCAAAGCAAGTTCTGATATTTCTAATGATGATAAAAAGTCTGGAAAAAGTACACCAGGAAGAAAGTCAACCCCACCATCAACCGAAAATGGAAAGAGCAGTCCTTTAAACGAACAAAAATCTTCATCCGCTGGATCATCCGGTACTAGCCCGATTATTCGTTCAGGATTAGAAGTTTTAGGGCATGGCAAGGATCATCTTGGAGCTTTTAAGAATATTCCTGGATTAGCTGGATTTAATCCTTTGGCTGGATTATGCTGCCCTCCTGGAATGGAACAACATGCGAATCCAGCGTTCCGACCTCCATACGCTGGAGCTCCCTTAAGTGCACATCACGCTGCTATGCTGGCAGCTGCTGCTGGTTTTCCCGGCTCGTCACCAAATCCCTACCTTGGATATGCCAGAGTTAAAACACCAGCGGGAGGAGAGACATTAGTTCCAGTATGCAAAGACCCATATTGCACTGGCTGCCAATTTTCAGTTAACAATCACCACCTTCTTATGAGTAACGGTGCTTGTCCCGCTGGATGTACTCAATGTGACCATCAAAAATATAATTTGGCCATGGCTATGGCTCTTTCACAACAAGGGGCTGCTGCAGGCCTTCCTTACACTCAAATGAGTCGGCCTTATATTTGTAACTGGATTGTTGGAGAGTCTTATTGTGGCAAGAGATTTGGTAATTCTGAAGAGCTTTTACAACATTTAAGAAGTCATACTACAGACGGATCGACTCCAGTATCATCTACGTCCTCCCAACCGTCTCTAATGAATCCACTAAATCCTCTATTTACGACCGCTGGACTCCGCAACGCTTACCCGACGGCTCCGCTAAGTCCTTTGTCTGCAAGCAGATACCACCCGTATTCAAAAGCTGCTCTTTCAGCAAGCTTAGGAGCATCTCCTTATGGAGCTTTTAATCCTGCTCTTGGCGCTTTTTATTCGCCCTATGCAATGTATGGACAAAGAATTGGAGCAGCCGCTGTGCACCAATAA

Protein sequence:

>DPOGS201265-PA
MVVLEDGVMMTTNPNQYLQPDYLTPLPSTLDSKKSPLALLAQTCSQIGADTLPSKPLLPPLEKKKTVNSVNSDAISRSSPSAKLDKPRSSPESKHLAFKPYETNVVTKKPEETRPTSKASSDISNDDKKSGKSTPGRKSTPPSTENGKSSPLNEQKSSSAGSSGTSPIIRSGLEVLGHGKDHLGAFKNIPGLAGFNPLAGLCCPPGMEQHANPAFRPPYAGAPLSAHHAAMLAAAAGFPGSSPNPYLGYARVKTPAGGETLVPVCKDPYCTGCQFSVNNHHLLMSNGACPAGCTQCDHQKYNLAMAMALSQQGAAAGLPYTQMSRPYICNWIVGESYCGKRFGNSEELLQHLRSHTTDGSTPVSSTSSQPSLMNPLNPLFTTAGLRNAYPTAPLSPLSASRYHPYSKAALSASLGASPYGAFNPALGAFYSPYAMYGQRIGAAAVHQ-