Monarch geneset OGS2.0

DPOGS210058
TranscriptDPOGS210058-TA1347 bp
ProteinDPOGS210058-PA448 aa
Genomic positionDPSCF300017 - 915452-919554
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0133590.096.50% 
BombyxBGIBMGA012698-TA0.095.79% 
DrosophilaCG9973-PB2e-3761.60% 
EBI UniRef50UniRef50_D6WJ432e-8447.28%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJ43_TRICA
NCBI RefSeqXP_002423435.15e-6237.68%hypothetical protein Phum_PHUM058930 [Pediculus humanus corporis]
NCBI nr blastpgi|2700072446e-8447.28%hypothetical protein TcasGA2_TC013796 [Tribolium castaneum]
NCBI nr blastxgi|2700072441e-8649.34%hypothetical protein TcasGA2_TC013796 [Tribolium castaneum]
Group
Gene OntologyGO:00036771.8e-06DNA binding
GO:00082701.8e-06zinc ion binding
KEGG pathway 
InterPro domain[382-421] IPR0028571.8e-06Zinc finger, CXXC-type
Orthology groupMCL17423 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210058-TA
ATGAGCGATACGCTCAGGAGCGAGGCGGGCGCGGATAGCGCGCACTTACCGCCCTTCTCGACCTTCGGTGAAATGGCGGAGAACGAGCAGAGGATCCTGACGGCCGACGCCAGGCTGCTGGATCCCGCGTGGGAGTACTACGAGAGGACCGGGGACACAGTCTCCGTCATCGCCAGCCAGCCGCAGTACCGCCCCTGGGAGTCGATGCCTATCACGGTCAATTCCAAGGATGCAATACTACGGGCGGGTTTCTCATCTCCTCTCGAATACCAGTCGATTACGTTACAACCGATACCAAACAAGCTACCGTCCTTCCAAAGCCAATTTCAGACGTTCCCAGAGACCACAGTCATACCGGAGACTGGTCTGCCGAGTGTGACGCCAGTTCCTGTCACCACGAGCCCCACACCCAGCGCGAGTCCTAGTCAGTTAACCCAGCTCACGCAACTAACGACACCTTCATCGCCAGCACATTTAACTACCTTGGCCCAGGTGGCGCCCTTGTCCAGTACTTTGACTACGCTTTCACCGGTCAATGCAACGACATTCCACACTCTCACCGCTGTGAACGCTCGGAGTTACCCAATCGTTCCGGCGCCCTTACAAGCCAGAGAGCTAGCACCGACAGGCCAAGCGTACATTGACGACCGACACATACAGCTTTACCAACCGAATATTGCAACCATTAACGCATTTCCGACACAAAATGGTATACTGCATCAGAACGGGGCGCTTCTACACCAGAACGGTAGCTTAATACAGAACATTCAAAGTCCAACAGTCGTTCATGTATTGAAAAACGAGCCGTTCGATATGAAATCATTACAAGACAAGTACACGCCGAACGGGCTGCATCACAGTAATTTTCAAAATCCAATGTTAATTGATAATAGCTACGAGAAGAAAGTGAACGGTTTCGGGAGTGGTTCATCGCCGACCAGGTCGGACTTCAGGAAAAAGGAGAGACGGAAAATGAGAGCGAATAGCTCGGAATCAGACTGTTCTAATATGGAGATGGGTTCAGAGAGTAGTGGACAGGTGGCAGCGGTGTCATCCACAGCAGGGTTCAAGTCCCCGATGCACGGCGCGCCGCCAATGAACACGGGACCCATGGAACTCGACGACATATCCAGCGAAAAACAGACTAAAAAGAAAAGAAAGAGATGTGGCGAGTGTATAGGCTGCCAACGAAAAGATAACTGTGGTGACTGCGCTCCGTGCAGGAACGACAAGTCACATCAGATATGCAAGCAAAGAAGATGCGAGAAGCTGACGGAGAAGAAGAATTATTTCACCCAACGACATATAATCGACTTTGACAGCTCAGTGACGCTGTCAAAGTACTGA

Protein sequence:

>DPOGS210058-PA
MSDTLRSEAGADSAHLPPFSTFGEMAENEQRILTADARLLDPAWEYYERTGDTVSVIASQPQYRPWESMPITVNSKDAILRAGFSSPLEYQSITLQPIPNKLPSFQSQFQTFPETTVIPETGLPSVTPVPVTTSPTPSASPSQLTQLTQLTTPSSPAHLTTLAQVAPLSSTLTTLSPVNATTFHTLTAVNARSYPIVPAPLQARELAPTGQAYIDDRHIQLYQPNIATINAFPTQNGILHQNGALLHQNGSLIQNIQSPTVVHVLKNEPFDMKSLQDKYTPNGLHHSNFQNPMLIDNSYEKKVNGFGSGSSPTRSDFRKKERRKMRANSSESDCSNMEMGSESSGQVAAVSSTAGFKSPMHGAPPMNTGPMELDDISSEKQTKKKRKRCGECIGCQRKDNCGDCAPCRNDKSHQICKQRRCEKLTEKKNYFTQRHIIDFDSSVTLSKY-