Monarch geneset OGS2.0

DPOGS215402
TranscriptDPOGS215402-TA1182 bp
ProteinDPOGS215402-PA393 aa
Genomic positionDPSCF300088 + 344654-346412
RNAseq coverage962x (Rank: top 13%)
Annotation
HeliconiusHMEL0097156e-11388.74% 
BombyxBGIBMGA012436-TA6e-13189.76% 
DrosophilaCG11360-PA2e-5956.70% 
EBI UniRef50UniRef50_B7FC522e-9361.18%KH domain protein n=9 Tax=Endopterygota RepID=B7FC52_TRICA
NCBI RefSeqNP_001137201.13e-9461.18%mex-3 protein [Tribolium castaneum]
NCBI nr blastpgi|2195220406e-9361.18%mex-3 protein [Tribolium castaneum]
NCBI nr blastxgi|2195220401e-9961.88%mex-3 protein [Tribolium castaneum]
Group
Gene OntologyGO:00037233.2e-14RNA binding
GO:00055159.7e-05protein binding
GO:00082709.7e-05zinc ion binding
KEGG pathway 
InterPro domain[129-196] IPR0040873.2e-14K Homology
[132-191] IPR0181111.4e-12K Homology, type 1, subgroup
[296-346] IPR0130838.9e-08Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL15742 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215402-TA
ATGTTTTGTAAACACATCTCATACGAGCAGAAGCCACGGCCTCCAGCCGTCGACTGCCGTATCCGCTATCCGCATGGCAACACCCGGGGGTGGGGGATTACTATGTTCACATTGACGCGGACCGAGTGTCTCGAACGACTGCAGTCTATGACATACATAGATACATACATCTATACATGGTGCAAGATCAAAGCTCTACGCGCCAAGACCAACACATACATCAAGACTCCGGTGCGCGGTGAGGAGCCCGTGTTCGTGGTGACTGGTCGCAAGGAAGACGTCGCACGCGCCAAACGCGAGATCCTCTCCGCCGCCGAGCACTTCTCGCAAATTAGAGCTTCGCGCAAATGTGGAGCTGCGCCCCCGCCCCCGGCCGGAGCGCCCGGACATGTGACCGCACAGGTGCGCGTCCCATACCGCGTCGTGGGACTGGTCGTGGGTCCCAAGGGCGCCACCATCAAACGCATCCAACACACCACTCACACGTACATCGTGACACCGTCGAGGGAACGCGAGCCCGTGTTTGAGGTGACCGGGCTCCCGGAGAGCGTGGAGGCAGCTCGCAAGGAGATCGAGGCTCACATCGCTCTTCGTACCGGAGCTGCCAACGGAGCGACCGGCGCGGGCGCGGTGGCCGGGGCGGAGGGCGAGCCCCTCGCTCAGCTGTACCGCGCTGGACTCGCGTCTTTGTTGCGTCCCGAACAGGAGGCCGCGTTTTCCTCCGCGGGATCTTGCTCGTCAGGCGGCTCCTCGGGGCGGCTCGGCGACCTGCTCGGCATCTGGTCCTCGACGGAGCGTGACGAAGGTCTCGGCGAGTCCCCGTCGTTCGAGTCCCCCGGCGCGGGCGGCGTGTGGGCGTGGGGTCCACCGCGCCCGTCGCCGGCCGCGTCTCCCGCTCGCACGTGCGGTCTCTGCTCGGAGCGCGGGGTGTCAGCGGCGCTGGTGCCGTGCGGTCACAACCTGTTCTGTTTTGAGTGCGCACAGCGGCTCGCCACGTCGGGGGCGGCGTGCCCGGCCTGTGCCTCTCCGACGCATCAAGCCATCCGCATCCTATCGCGGCGAGAGGTGGGCGCACGCCGCGCTCGCCGCACTCGCCGCCGCAGAGAGGCGCGGGAGGAGCGGCGCGGGGTCCGCGGCGACCTAGCCAATGTGTCGTGGGTAGCCAATGCTGAGTCGGAGTGA

Protein sequence:

>DPOGS215402-PA
MFCKHISYEQKPRPPAVDCRIRYPHGNTRGWGITMFTLTRTECLERLQSMTYIDTYIYTWCKIKALRAKTNTYIKTPVRGEEPVFVVTGRKEDVARAKREILSAAEHFSQIRASRKCGAAPPPPAGAPGHVTAQVRVPYRVVGLVVGPKGATIKRIQHTTHTYIVTPSREREPVFEVTGLPESVEAARKEIEAHIALRTGAANGATGAGAVAGAEGEPLAQLYRAGLASLLRPEQEAAFSSAGSCSSGGSSGRLGDLLGIWSSTERDEGLGESPSFESPGAGGVWAWGPPRPSPAASPARTCGLCSERGVSAALVPCGHNLFCFECAQRLATSGAACPACASPTHQAIRILSRREVGARRARRTRRRREAREERRGVRGDLANVSWVANAESE-