Monarch geneset OGS2.0

DPOGS216108
TranscriptDPOGS216108-TA840 bp
ProteinDPOGS216108-PA279 aa
Genomic positionDPSCF300182 - 152427-154594
RNAseq coverage529x (Rank: top 24%)
Annotation
HeliconiusHMEL0207234e-5473.66% 
BombyxBGIBMGA009225-TA1e-11576.82% 
DrosophilaSbf-PA6e-7951.74% 
EBI UniRef50UniRef50_E2ADV62e-10059.04%Myotubularin-related protein 13 n=8 Tax=Formicidae RepID=E2ADV6_CAMFO
NCBI RefSeqXP_394363.35e-10059.39%PREDICTED: similar to SET domain binding factor CG6939-PB, isoform B isoform 1 [Apis mellifera]
NCBI nr blastpgi|3504158683e-10059.73%PREDICTED: myotubularin-related protein 5-like isoform 1 [Bombus impatiens]
NCBI nr blastxgi|3504158685e-9959.73%PREDICTED: myotubularin-related protein 5-like isoform 1 [Bombus impatiens]
Group
Gene OntologyGO:00055153.7e-23protein binding
GO:00355561.1e-11intracellular signal transduction
KEGG pathwayptr:4703485e-09 
 K12362 (RASGRP3)maps-> MAPK signaling pathway
    B cell receptor signaling pathway
InterPro domain[174-278] IPR0119933.7e-23Pleckstrin homology-type
[176-280] IPR0018494.2e-16Pleckstrin homology domain
[85-134] IPR0022191.1e-11Protein kinase C-like, phorbol ester/diacylglycerol binding
Orthology groupMCL35091 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216108-TA
ATGCCGGACGCTTTCACCGGTCTTTTGGAGCGACTGAGACAGCTAGAGCTGGAGCTGGGACACCTGCCGCAGAAGTGGAGGGTCAGGTGGGAACAGCTGGAACTACCCAATACACTGGAGTTACAGCGAAATTCTTCGATGTGTAGCGGAGTGGTGAGACGGGCCACGGCGGCCGCTCACAGGCGGTGCACGCGGGAGGTGCTCGCTCGCGGTCGGCTGGCGGCTGCGCCCCCCGCGGGTGCTGCGCCCCTTCACCGCTGGGCGCCCGCGCCCGCCGCCGCGCCCGCTAGATGCGACCACTGCTCAGATCTGCTATGGGGTCCCCTAGAGACGGGTGTCCGGTGTGTGGACTGTGGCGCGGCGTGTCACGAGCGCTGTGCGGAGGCTCTGGCGCTGTCCTGTACACGGTACAAGGCGCCGCCACCTGACAGAGAGAGGGACAGGCCCGTGGGAGCTGGTGTGGCGACCCTGCAGCCCTCGTCCCAGCAGTGTTACGAACAGTTCTCCAGTAACGTTGCCGAGAATAGGACGCACGAAGGACATCTGTACAAGAGGGGAGCTCTGCTCAAGGGGTGGAAACAGAGATGGTTCGTGCTGGACTCCATAAAGCATCAGCTGCGCTACTACGACGCTATGGAGGATTCGCATTGCAAGGGGTTTATAGAGCTGTCGGAGGTGCTCACGGTCTCCCCCGCGCCGGCCGCGCCCGGCCCGCCCAAGAAGTGTGACGACCGCTCGTTCTTTGACCTTCGTACGAGCAGACGGACGTACAACTTCTGCGCGAGCGACGCCAATGCAGCGCAGGAGTGGATCGAGAAGCTGCAGGGCTGCTTACAGTGA

Protein sequence:

>DPOGS216108-PA
MPDAFTGLLERLRQLELELGHLPQKWRVRWEQLELPNTLELQRNSSMCSGVVRRATAAAHRRCTREVLARGRLAAAPPAGAAPLHRWAPAPAAAPARCDHCSDLLWGPLETGVRCVDCGAACHERCAEALALSCTRYKAPPPDRERDRPVGAGVATLQPSSQQCYEQFSSNVAENRTHEGHLYKRGALLKGWKQRWFVLDSIKHQLRYYDAMEDSHCKGFIELSEVLTVSPAPAAPGPPKKCDDRSFFDLRTSRRTYNFCASDANAAQEWIEKLQGCLQ-