Monarch geneset OGS2.0

DPOGS212395
TranscriptDPOGS212395-TA3069 bp
ProteinDPOGS212395-PA1022 aa
Genomic positionDPSCF300019 + 1063646-1075868
RNAseq coverage229x (Rank: top 44%)
Annotation
HeliconiusHMEL0107080.062.66% 
BombyxBGIBMGA012078-TA6e-15445.76% 
Drosophila% 
EBI UniRef50UniRef50_C3Z7E02e-7227.68%Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Z7E0_BRAFL
NCBI RefSeqXP_966626.26e-9930.52%PREDICTED: similar to sorting nexin 14 [Tribolium castaneum]
NCBI nr blastpgi|1892350751e-9730.52%PREDICTED: similar to sorting nexin 14 [Tribolium castaneum]
NCBI nr blastxgi|1892350751e-9630.51%PREDICTED: similar to sorting nexin 14 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.5e-09protein binding
GO:00071541.5e-09cell communication
GO:00350911.5e-09phosphatidylinositol binding
GO:00048712.1e-07signal transducer activity
KEGG pathway 
InterPro domain[326-402] IPR0031143.6e-15Phox-associated domain
[432-566] IPR0161377.2e-12Regulator of G protein signalling superfamily
[648-766] IPR0016831.5e-09Phox homologous domain
[445-564] IPR0003422.1e-07Regulator of G protein signalling
Orthology groupMCL15983 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212395-TA
ATGAACATTTTCCAGACATTCTTAGATACGATATCGTGCCATTTTATGAGGAAGCTGCCCAGTCCAGATCAGAAAAAAACTTCTGTCAGAATTTGTAATGCCTGTGATGCTCCTAACTGCAGTCGCCATGATCCAGAACTATTACCCGATCCCTGGGTCGGATTACTCATTCATAAACAATTGGACCAGGCTATAGAAGATTTTTATAACAGAATTTTAGAGCAATTTATAAATACATGGTACAGCAAAATAACTTTGCAGCCATTTTTTATTGATGAACTTCGACAACAGTTGAGATATGCATCGGCATGTTTGTACAGGAGAGCTCTTACAATAGATCCAGGTGTGTTCATCTTCAGCCGCCTTTTGCCATGTGCGTTACGTCACGTGTCCCTCCCCCCGGCCGCCACTCACACCGCTCTGAGCTCGCGGGCGTCTGAACTACGCTATCTGAGATGTCTGTCTGATACACTGCTGCCATATCTCCTACGACCGGCAGATTGTCATAATGAATTTCACCTGATCACAATAATAGTAAGCTATGTGTTGGGATGTTTGGCTTGTTATTATGGACTTCAGATGAACATTTTCCAGACATTCTTAGATACGATATCGTGCCATTTTATGAGGAAGCTGCCCAGTCCGGATCAGAAAAAAACTTCTGTCAGAATTTGTAATGCCTGTGATGCTCCTAACTGCAGTCGCCATGATCCAGAACTATTACCCGATCCCTGGGTCGGATTACTCATTCATAAACAATTGGACCAGGCTATAGAAGATAATTTTAGAGCAATTTATAAATACATGGTACAGCAAAATAACTTTGCAGCCATTTTTATTGATGAACTTCGACAACAGCTGAGATATGCATCGGCATGTTTGTACAGGAGAGCTCTTACAATAGATCCAGGTGTGTTCATCTTCAGCCGCCTTTTGCCGTGTGCGTTACGTCACGTGTCCCTCCCCCCGGCCGCCACTCACACCGCTCTGAGCTCGCGGGCGTCTGAACTACGCTATCTGAGATGTCTGTCTGATACACTGCTGCCATATCTCCTACGACCGGCAGATTGTCATAATGACCATGACCCGCCTATGTGTAGAGTGTTCCGCACCTTGGTCCGTGAGCTGGTGTCGTGGTGGGTTCTTCTGCCGATGGTGGACGTGCTGGCTGACCCCTACACCCTCAACACTCTCCTGCTGCTGGTCACCGGGGATGACACCATGGCGCCGTTACCCATCACGCCAGATTATAAAGTCGAGTTCCTGGAGTGTTTCGTTCGTCAGTCGGAGTGCGTGTATTCGTCGAGGCCTCGTCTGCTTCGCGTTCAGTTGGAACCATTAGTCCAGGAGTCAGGCGCTCTATACGCGCTCCTGCAATACTTGAAAACAACACCTCATTTACAGCTCTTACAGTTCTACCAGGACATCAAATCATTTCAAACTCGAATTCTCAACCCCGAGCTGACTGTATCAGAACAGGCGTCTCTCCAGCGTGAAGCCGAGGAGTTGTTTTCGCGGTACAGAAGTAGTGGACATCACGAGCTGATACAGGAGATGGAACAACTCATGAAGGAGGGAGGAGTTAGGAGACTTCAGACATCTCGAGCGCTCTATCAGGCCGCCAGACAGGCGCACGGTGCCTTAGAGAAGACAATGCTACCAAAGTTCTTACATAGCGAAGAATTCTACAAATTATTAATAGGCCCGAGGCTGCCGGTGGGGTACCAGAAACAAATGACGAAGAGGCCGGAAGACAAACAGAGCATGTTGAAACTCGGCGTGAGGATAAAGAATGCTTTAAAGCAGCAGGTCGTGGACGGCCAGGTGTTGGAGATGACGTCACAGCTGGACGAGGGGGAGAGTATAGAGAACATAGACATCCTGCAATACCTGGACTCCCTCGCCGCCGAGGACTCCTTGGATCAGGATCTCAGCACTTACTTTCACCAGGATATCTTAATGATACTTATCTCCGTCCAGCAAGCCCCTCCGCGTCGTGGTCCGGTGCGAGTGTTCACACTGGCGGTGCACCGCTCGGGGCGGTCCGTGGGAGCTATGGACGTTGAAGCGGCGTTGTGGCGCGTCAAGCGAAGCGAACATGACTTCCACCTGCTGCGAGCCAAGCTGAGGGAGTTCCACGGAGACGCGTTAGCGCTGCAGCAGCTGCCGTCTAGAAGAGACAATAGTCCTCTGGAGACTCTCCGCTACAAGTATGAAGACTTCCTCCAGAGACTGCTACAGATCTCCCTTCTCCAGACCAGCGAACTCCTGCATTTATTCCTTACAGTTGATGGAGATTTCTCCCTAGTGGTCCAAGCGTCGACCTTGAACGCCTCGAACACAGACCTAGGAAACATTTACCAGTCCGTGGCACATAAACTACGGAAGGAGAAGGGACAGCACCTGGAAAGCTTCCTCAGGAATTTTCTGGTGTCGTCGGATAAGGAGCGGTATCAGGCTTTAAAACAGGGTTCCCAAGTGGAAGAAGCTCATGAAGTGAACGAAGAAGATACGGAGAAGATCGTCAAGAGACAACACAACGTCCGCAGCATACAGTCCAGCGTGTTCGGGAACAACTTTGATACAGAACCGGAAGTGACGCACATTCAGACCCACTACCAGGACACCGTGGTCGGCTTCACACAGTGCTTTATGTATTTACTAATAAAAGTGTTAAAAGTCCCCGGGCTGGTGGTGGGCGTGGTGGGCAGCGTGCTATCTCTGGTGAGCGACTCGCTGGACCTCGCGGGCTCGGCCTTAACCAACAAGTACCTCAAGGAGCTGTTGAACGAGAGACGACTGGCGCATCTCATACGACTTGGACACAATCTCCTGTTCAATGACCGCACCCCCCGCAGCCCCGCGTCGCTGGTGACGTCACGGGCGCGGGCGCGGGTGGCGGGGGCGGGGAGGGGCGCGAGGCTGTGGTCGGGGGTCGTGCAGGATGTGTTCGACATGATGCAAGTACCGAGGATGAACAAACAGCTGGTTTATAATTTATTGGACCTGTGTGTGCTGGAACTGTTCCCGGAGCTGCGGACCCCGGGAGCGTCGCACGCGGACACCTGA

Protein sequence:

>DPOGS212395-PA
MNIFQTFLDTISCHFMRKLPSPDQKKTSVRICNACDAPNCSRHDPELLPDPWVGLLIHKQLDQAIEDFYNRILEQFINTWYSKITLQPFFIDELRQQLRYASACLYRRALTIDPGVFIFSRLLPCALRHVSLPPAATHTALSSRASELRYLRCLSDTLLPYLLRPADCHNEFHLITIIVSYVLGCLACYYGLQMNIFQTFLDTISCHFMRKLPSPDQKKTSVRICNACDAPNCSRHDPELLPDPWVGLLIHKQLDQAIEDNFRAIYKYMVQQNNFAAIFIDELRQQLRYASACLYRRALTIDPGVFIFSRLLPCALRHVSLPPAATHTALSSRASELRYLRCLSDTLLPYLLRPADCHNDHDPPMCRVFRTLVRELVSWWVLLPMVDVLADPYTLNTLLLLVTGDDTMAPLPITPDYKVEFLECFVRQSECVYSSRPRLLRVQLEPLVQESGALYALLQYLKTTPHLQLLQFYQDIKSFQTRILNPELTVSEQASLQREAEELFSRYRSSGHHELIQEMEQLMKEGGVRRLQTSRALYQAARQAHGALEKTMLPKFLHSEEFYKLLIGPRLPVGYQKQMTKRPEDKQSMLKLGVRIKNALKQQVVDGQVLEMTSQLDEGESIENIDILQYLDSLAAEDSLDQDLSTYFHQDILMILISVQQAPPRRGPVRVFTLAVHRSGRSVGAMDVEAALWRVKRSEHDFHLLRAKLREFHGDALALQQLPSRRDNSPLETLRYKYEDFLQRLLQISLLQTSELLHLFLTVDGDFSLVVQASTLNASNTDLGNIYQSVAHKLRKEKGQHLESFLRNFLVSSDKERYQALKQGSQVEEAHEVNEEDTEKIVKRQHNVRSIQSSVFGNNFDTEPEVTHIQTHYQDTVVGFTQCFMYLLIKVLKVPGLVVGVVGSVLSLVSDSLDLAGSALTNKYLKELLNERRLAHLIRLGHNLLFNDRTPRSPASLVTSRARARVAGAGRGARLWSGVVQDVFDMMQVPRMNKQLVYNLLDLCVLELFPELRTPGASHADT-