Monarch geneset OGS2.0

DPOGS213479
TranscriptDPOGS213479-TA1338 bp
ProteinDPOGS213479-PA445 aa
Genomic positionDPSCF300100 + 49587-51071
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0129271e-2331.67% 
BombyxBGIBMGA004361-TA1e-14959.21% 
Drosophilapita-PA8e-2326.95% 
EBI UniRef50UniRef50_D6WQR37e-5243.27%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WQR3_TRICA
NCBI RefSeqXP_975272.11e-5243.27%PREDICTED: similar to lost on transformation protein 1 [Tribolium castaneum]
NCBI nr blastpgi|910871392e-5143.27%PREDICTED: similar to lost on transformation protein 1 [Tribolium castaneum]
NCBI nr blastxgi|910871393e-5535.89%PREDICTED: similar to lost on transformation protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00036765.3e-10nucleic acid binding
KEGG pathwaymcc:7045293e-24 
 K12463 (PRDM4)maps-> Neurotrophin signaling pathway
InterPro domain[130-154] IPR0130875.3e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25012 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213479-TA
ATGGCCAGTCCGCCACCGAACCCTAGAGCAGGTGCGGAGAGCGGCTCCCAGTTCACGGAGGAAACGGGCGGCCCACGCGCGGCCGCCCCGCAGAAGAAGTTCATAGCACCGCCTGCCTCGCAGCTGCCAACCAAGTTGGAGTACGTCGCTCGCGGAGGAACGGGCCGCAGCGTCAAGCACACCTTCCGAACAGTCAAGTTCGCTCGCAGAGTGCCCACGCTGCCTCCCAAGAAGCCCGAAGGTGGGGCGGGCGGCGCGGGTGAGGCGAGCGGCAGCGGCGCCTCGGCCGAGCGCCGCCCGCGTCGCGACCGCAGCAAACGGCATGTCTGCACCACTTGCGACAAACGCTTCTCCAGCCCCGGCAAGCTGAGTCAACACGTTCTCTCTCATACTGGTGAGCTCCCGTTTTCGTGTGATTTATGTGACAAGCGCTTCAATTCTAAATTTAAACTAGTGCGTCACAGTCTCATCCACAGTGAGTCTAGAGCCTTCGCGTGTACCGTCTGCGCGGAGGAAGGCAATTTACAGTGCAAGATTTGTGATGAAGTGTTCAATTCGAGACAAGAAATTGTTAACCATCTTAAAGTTCATACAGGGAGTCGAGCGCCCAAAAGTGACACTGATAAGAAGTTTACCTGTGATCATTGTGACAGAAGGTTTTTTACAGCAAAAGATGTTAGACGACATCTTGTGGTTCACACAGGAAGGAGAGATTTTTTATGCCCTCACTGCCCACAAAAGTTTGGCCGTAAAGATCACCTGGTCCGTCATGTTAAGAATGCACATCCCGAGGAGTCATGGAAATCAGCAGCTGTGGGTACATCCAAAGACCCACCGCCAGAAGCTACCTCATTTGAAGAAACTTATACAGAATATAACATTGAAGAAACAGATTTCAACCTTTGGAAGACCATAACTCCTAAAGAAGAAGTACCAGAGGGACGGACAGAGGTCCCGGCTTCTGATATAATAGTAGAAATACCAGATTTAGGGATCAAAGTGGAGCCTTTAGATATAAAGTTAGAAAATCCTCAATCTCCCAGTGAAATACTCGAATATCCAGTAGTTTATATGACCGACTTGCCATATATAACACAACCAATTCGAGATCCTATTGATGTGCATTTACTTAGTTCTGGCAACGTTCAATCAATATTGTTGGACCCTGGCGAGGGTCCTTCGGGATTGTCGAGTCAGATGTTGGGGCTGTTGGAGGAGGGTGAACCTTCTTATCCTAGCGACGAGGGACGGGTGCAGCAGAGGCTGCCGGCTTTCACGCAGGCTTTTCAGACCGCCCAGAGCCCTAAGCCTCCCCCGCCCCCGCCCCCGCCGCACTAA

Protein sequence:

>DPOGS213479-PA
MASPPPNPRAGAESGSQFTEETGGPRAAAPQKKFIAPPASQLPTKLEYVARGGTGRSVKHTFRTVKFARRVPTLPPKKPEGGAGGAGEASGSGASAERRPRRDRSKRHVCTTCDKRFSSPGKLSQHVLSHTGELPFSCDLCDKRFNSKFKLVRHSLIHSESRAFACTVCAEEGNLQCKICDEVFNSRQEIVNHLKVHTGSRAPKSDTDKKFTCDHCDRRFFTAKDVRRHLVVHTGRRDFLCPHCPQKFGRKDHLVRHVKNAHPEESWKSAAVGTSKDPPPEATSFEETYTEYNIEETDFNLWKTITPKEEVPEGRTEVPASDIIVEIPDLGIKVEPLDIKLENPQSPSEILEYPVVYMTDLPYITQPIRDPIDVHLLSSGNVQSILLDPGEGPSGLSSQMLGLLEEGEPSYPSDEGRVQQRLPAFTQAFQTAQSPKPPPPPPPPH-