Monarch geneset OGS2.0

DPOGS215202
TranscriptDPOGS215202-TA1506 bp
ProteinDPOGS215202-PA501 aa
Genomic positionDPSCF300143 - 67234-72938
RNAseq coverage955x (Rank: top 13%)
Annotation
HeliconiusHMEL0038655e-14462.87% 
BombyxBGIBMGA008681-TA2e-10954.84% 
Drosophilaloco-PD4e-4044.50% 
EBI UniRef50UniRef50_B4GNT51e-5734.95%GL13729 n=7 Tax=Drosophila RepID=B4GNT5_DROPE
NCBI RefSeqXP_002069924.15e-5935.63%GK11781 [Drosophila willistoni]
NCBI nr blastpgi|1954445629e-5835.63%GK11781 [Drosophila willistoni]
NCBI nr blastxgi|1954445622e-5535.63%GK11781 [Drosophila willistoni]
Group
Gene OntologyGO:00048712.4e-32signal transducer activity
GO:00071653.3e-05signal transduction
GO:00050573.3e-05receptor signaling protein activity
KEGG pathwayhmg:1002018745e-18 
 K10260 (FBXW7, SEL10)maps-> Ubiquitin mediated proteolysis
InterPro domain[65-183] IPR0003422.4e-32Regulator of G protein signalling
[54-183] IPR0161372.8e-32Regulator of G protein signalling superfamily
[58-94] IPR0240663.3e-14Regulator of G-protein signaling, domain 1
Orthology groupMCL13081 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215202-TA
ATGGTCGTGTGGTATACGGACGACGACGAGAGCGAACAGTCCTCCCCGTTCCGTCGATGGACCACAGGAGGTGGCGCTGGCAGCTCTTACAGGCACCCAGACCACCGAGGTCTATACAATAAGCAAATGAGCGAAGGTTCGGCGCCAGCCCAGTCTGGTTCCGCTAAGGGCGGCGTGGCTCGCTGGTCGTTAGGCCTGGAACAGCTGCTGGCCGATCCAGCTGGGGCTGCAGCCTTCGCCCACTTCCTGGACAAGGAGTACGCCGCTGAGAACATCCGGTTCTGGTGGTCGTGCGAGCAGTACCGCCTCTGTGGGGCGGAGGCCGAGCGCTCCGCCCTCGCCACTCAGATCTGGCAGCGACACCTCGCGGACGGGGCCTCGGATCCCGTGAATGTTGACGCCGCAGCCCTGAGGGCGGTCACTCTCAGGCTGCACCAGACTCCACCGCCGCAAGATCTTTTTCTCCAGGCTCAGAAGCAGATCTTCAACGTGATGAAGTTCGACAGCTACCCTCGTTTCCTTCGCTCGTCGGTCCACGCGGAGTGCGCCCGAGCTGACCTCCGAGGCCTCCCGCCGCCCTACGCACCAGAAAACAACAAGCTGAAGAAGACTTCGTCGAACGCGTCCGAGAGGCGGCGCAGCGGGTCGCTCCTGCCCTGGAGGACGAGGGCGCTGTCCAGGGAGAGAGACGAAGACGTGGTGAAGTCGAGTCAAACGAACGGTCAGTGTTCGTTGTGCCGCGTGGTGCTCCCAGACGGTGCTACGTCGGTGGTGGGGGTGGACGAGGAGGTCACCGTCAAACGTCTCGTGGACAGACTCCTCCAGAGACGAAACTTGGACTGCAACACCTATGACGTCATACTCGTCGATCAGGCGGGTGAGGGTTCCACCGTCCCGTGTTCCGCTCCGTCGTCCCGCCTCGGAGGTCGCGTAGCGCGCGTGGAGCGCCGCGTGGTGTTACGCGTGTCCGTCGCCGGGCGGGCGGTCGCGGTCCGCTGTCGTCCATCGCGGCGCCTGCGACACGTCCTGCGCCCCGTCCTGCAACGCTACTGGCCAGATCTCGGCTCTCATCGGGTCCTCTCCGCCGGCGTCCCCATACACCCCGACACGCCAGTCGACGAGCTGGACGGGGCCAGGGTTCAGATACTAGAAGACGACACGACCAGCACGCCGCCCGTGACGATCACCCGCGACGAGGACGGAGACTCGCTCAGCGACCTGGCTCTCCGCCCCGACGACCTCGACGACAACCAGAGTACAAGTCGAAGTAGCGTCAGTTCTAACCAGACTGTGGACGTGACTTCCCTTGTGAACACGTCCGGTGTGAGCGGGGGCGGGGGCGGCGGCCCGGGCTCCAGGGTGAGAGCGGCGCTGCGGGCCGGCCCGCCACTACATCATCATCCTCCTGATTTCCTTGAAAACCTGCGCGAGACGCAGCGTCAGAGGCTCCCGGCGCGCACGCCGCCGCCGCTACCTCCTAAACCTCGAGCGCCGACTCTCGTGTGA

Protein sequence:

>DPOGS215202-PA
MVVWYTDDDESEQSSPFRRWTTGGGAGSSYRHPDHRGLYNKQMSEGSAPAQSGSAKGGVARWSLGLEQLLADPAGAAAFAHFLDKEYAAENIRFWWSCEQYRLCGAEAERSALATQIWQRHLADGASDPVNVDAAALRAVTLRLHQTPPPQDLFLQAQKQIFNVMKFDSYPRFLRSSVHAECARADLRGLPPPYAPENNKLKKTSSNASERRRSGSLLPWRTRALSRERDEDVVKSSQTNGQCSLCRVVLPDGATSVVGVDEEVTVKRLVDRLLQRRNLDCNTYDVILVDQAGEGSTVPCSAPSSRLGGRVARVERRVVLRVSVAGRAVAVRCRPSRRLRHVLRPVLQRYWPDLGSHRVLSAGVPIHPDTPVDELDGARVQILEDDTTSTPPVTITRDEDGDSLSDLALRPDDLDDNQSTSRSSVSSNQTVDVTSLVNTSGVSGGGGGGPGSRVRAALRAGPPLHHHPPDFLENLRETQRQRLPARTPPPLPPKPRAPTLV-