Monarch geneset OGS2.0

DPOGS216042
TranscriptDPOGS216042-TA1155 bp
ProteinDPOGS216042-PA384 aa
Genomic positionDPSCF300067 - 293588-297828
RNAseq coverage405x (Rank: top 30%)
Annotation
HeliconiusHMEL0082223e-2436.57% 
BombyxBGIBMGA009020-TA4e-16576.26% 
DrosophilaIng3-PA1e-5061.07% 
EBI UniRef50UniRef50_D6WBV74e-10655.66%Putative uncharacterized protein n=4 Tax=Pancrustacea RepID=D6WBV7_TRICA
NCBI RefSeqXP_969965.13e-10454.06%PREDICTED: similar to inhibitor of growth family, member 3 [Tribolium castaneum]
NCBI nr blastpgi|2700014161e-10555.66%hypothetical protein TcasGA2_TC000235 [Tribolium castaneum]
NCBI nr blastxgi|3838609074e-11155.40%PREDICTED: inhibitor of growth protein 3-like [Megachile rotundata]
Group
Gene OntologyGO:00055158.4e-12protein binding
GO:00082708.4e-12zinc ion binding
KEGG pathway 
InterPro domain[289-380] IPR0110114.3e-23Zinc finger, FYVE/PHD-type
[315-376] IPR0130832.6e-22Zinc finger, RING/FYVE/PHD-type
[327-372] IPR0019658.4e-12Zinc finger, PHD-type
[327-373] IPR0197873.8e-10Zinc finger, PHD-finger
Orthology groupMCL11147 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216042-TA
ATGTTATATCTTGAAGATTATCTAGAAATGATAGAGCACTTACCTCAAGAATTAAGAGATAGGTTTACAGAAATGCGAGAAATGGATTTATCCGTTCAAAATAACATGGACACATTAGAAAAACGTGTTCGCACACTTTTTGGTAGTTGTCGCCGAGGTGAAGTTAATACTGACCAAGCAAACACAGAATTTTCTGACATTAAACGGGGTTATAACAAAACACTTGAAGAGGCAGATGAAAAAGTCACATTAGCGAACCAGATGTATGATTTGGTGGATAGATATTTACGGAGACTAGATACTGAACTTCATAAATTCAAGTGTGAACTGGAGGCAGATAATAAAGGAATAACAGAGTTACTGGAAAAGAGATCCTTGGATCTTGATACAAATATCAATCATACATCCACATCTAACAATAACCACTACAAGGATAACAAATATCGTATTCGTGCTGAGAAAAGACGAGAAGCAAATTGGGCTCCCCGCGACGCTAGGGCACATACTAGCAACGGAAACCATTCACGTACTGACACAGCTTTACAGGCAGCCCTTGGTCGTGATTCCTATTCTCTTGGTCATGCTGGTAGTACTATAGCGGCAGCAGCAAGTCAAGCTATTGCTGCTACACAACAGATGCAACAAGGTCGTCGCACGGCTTCCCTGAAGGCGTCATACGAGGCGGTCGCTAGTGAACTGGCGCATCACGCACATCATGACCACGGTCAGGCATTAGCTTCACACTCTCAAAGCGCTTCTCACGCACACTCCACTACAACGGCTGTGGCTAACAAGCGGCATAACAAGCAAAAGAAGAATTCCTACAGTTCTAGCAACGTGTCTCTTCACACTCAACCAACGGCTGCAGTCGCAAGTCGTTCATCGTCGCCAACTGTGGGTATAAACACTGTGGTCAACAACGTTGCTATCGAAGAGCCCATGGAAGAAGAATGGACTTACGACCCGAACGAACCACGCTATTGTATCTGCAACCAAGTGTCGTACGGTGATATGGTCGCCTGTGACAACCAAGACTGTCCTTACGAGTGGTTCCACTATCCCTGTGTTGGTATCACAGCACCTCCTAAGGGTAAATGGTACTGTCCCCAATGCCAGACCAATATGAGAAGAAATCGTGCTCATCGCAAGAACTAA

Protein sequence:

>DPOGS216042-PA
MLYLEDYLEMIEHLPQELRDRFTEMREMDLSVQNNMDTLEKRVRTLFGSCRRGEVNTDQANTEFSDIKRGYNKTLEEADEKVTLANQMYDLVDRYLRRLDTELHKFKCELEADNKGITELLEKRSLDLDTNINHTSTSNNNHYKDNKYRIRAEKRREANWAPRDARAHTSNGNHSRTDTALQAALGRDSYSLGHAGSTIAAAASQAIAATQQMQQGRRTASLKASYEAVASELAHHAHHDHGQALASHSQSASHAHSTTTAVANKRHNKQKKNSYSSSNVSLHTQPTAAVASRSSSPTVGINTVVNNVAIEEPMEEEWTYDPNEPRYCICNQVSYGDMVACDNQDCPYEWFHYPCVGITAPPKGKWYCPQCQTNMRRNRAHRKN-