Monarch geneset OGS2.0

DPOGS212134
TranscriptDPOGS212134-TA3093 bp
ProteinDPOGS212134-PA1030 aa
Genomic positionDPSCF300038 + 47682-63426
RNAseq coverage364x (Rank: top 33%)
Annotation
HeliconiusHMEL0049850.070.26% 
BombyxBGIBMGA006584-TA0.075.93% 
DrosophilaCG42450-PA0.060.45% 
EBI UniRef50UniRef50_E0W0J60.058.48%Regulator of G-protein signaling, putative n=2 Tax=Neoptera RepID=E0W0J6_PEDHC
NCBI RefSeqXP_001602584.10.062.79%PREDICTED: similar to regulator of g protein signaling [Nasonia vitripennis]
NCBI nr blastpgi|3838579740.061.51%PREDICTED: regulator of G-protein signaling 7-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3454821340.043.76%PREDICTED: hypothetical protein LOC100118674 [Nasonia vitripennis]
Group
Gene OntologyGO:00048713.3e-42signal transducer activity
GO:00071863e-18G-protein coupled receptor protein signaling pathway
GO:00058343e-18heterotrimeric G-protein complex
GO:00355569e-15intracellular signal transduction
KEGG pathwayoaa:1000789787e-100 
 K13765 (RGS9)maps-> Phototransduction
InterPro domain[412-528] IPR0003423.3e-42Regulator of G protein signalling
[405-539] IPR0161373.1e-37Regulator of G protein signalling superfamily
[112-204] IPR0119917.3e-19Winged helix-turn-helix transcription repressor DNA-binding
[332-393] IPR0158983e-18G-protein gamma domain
[401-443] IPR0240661e-15Regulator of G-protein signaling, domain 1
[124-202] IPR0005919e-15DEP domain
Orthology groupMCL13084 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212134-TA
ATGACATCGGTAGATGAATTCGTAAACTCATCGGCATGGCTTATCAAGGTATTGATAGTGATAATCGTGATAGTAAGTGTGTGCGCAGATGTTTTTAACATTACGTATGTACATTTAGACCGTCCGTCAAAGAGATCGCGTCCCCACTATAAGCTTGCTCTAACGAAAGCGATAAAGGTGTCGGGGGCGCAGCGTGGGCGCGAGCGGCATGCGGCCCCCGCGACAGACTCCCATGGAGGTAGGATAGCGCGCGGCCAGCCGCGCCCACTGCCCCTGCTCACCGCGCTCGCCATGGAACCCGCGGATGTGCCGAGGCACAGACCCCTCGCATTCGATAAGATGGAAAGCTTGATTAAAGAAATGCAAGATCCGGACACGGGTGTACCAGTTCGAAGTCAAAAACTTTTTCTGACCTACGTACCTTCTGCGTTTGCAGCGTCAGACGTTATCGAATGGATTATGGAGCGGTTTAACGTAGACGATTCTAACAACTCGGAAGGATTGATTTTAGCAAATCAGCTCTGTCAATATGGTTACCTCTTTCCTGTCAGCGATTCAAAAGTCCTTGCATTAAAGGATGACAATTCTCTTTTTAGATTTCAAAGCCCATACTACTGGCCGTGGCAAGGTCCCCGGGTGGCCGGAGCTGGGTCGGGCGCGCCAGCCCTGGGGCCCGATAACGTCGAGTACGCAATCTACCTCGTGAAACGCACATTACGCAATAAACAGCGTCACGGCCTGGAGGAGTATGAACAGGAAGCGCTCGCGAACCTCAAGAAGAACCTTGCTGCTAAATGGGACTTCATTACTATGCAAGCGGAAGAACAGGTCCGGTTGGCCAAAGAGCGCAAAAAAGGTGACAAGATAGTAAGCGATAGTCAGGAGAGAGCGTACTGGCGTGTGGCTAGACCGCCGCCGGGGACGCTCACGGCGCTAGAATCCTGTCCGGTGCCAGTTCGCGCTCGACACCCGACCAAACCTAAGAAGAGAACCATACAGCAGATCACGAGAGAGATTGAACACTTAAAGGCTAGCCTGGATCGTACGCGTGTGAAGACCTCAATTGCCCTCGAAGCTCTAATGGCTTATTCGGAGACCTTCGCCGCCTATGATCCCTGGCTCACACAGCCTCAGCCCTCGAATCCTTGGATTACGGATGACACTCTCTTTTGGCAAATCAACAGTCCCATTGTTGAAGTACCGAGCGAGAAACGTGTTCAGCGTTGGGCTGTTTCTATAGAGGAATTAGTATCAGACCCTACTGGTCTTCAGGAGTTTACGAGTTTCCTTAGAAAAGAATATTCTCACGAAAATATTCGATTTTGGTTAGCTGTTATGGACCTTAGAAGAAGTAGTACTAAGCAAATACCTAAGAAGCTTGAAGAAATATATGAGGAGTTTTTAAAGCCAGGCGCGCCCTGCGAAATCAATATCGATGGAGCGACGGCGGATCGTGTTACGGAAGGTATCCGCAGTGGATCGCGATACGCTCTGGATCACGCGGCTGATCATGTTTACGGATTGCTATTGAAAAAGGATTGCTATCCACGATTCATTAGATCGGATCACTTCCAACGGCTGTTGGCTGAGGGTAGAAATGTACATCAAAAGAAAGCTAAATTTTTCAATTTTGGAGGTCAAGTAAAGAAAAAGCCGGGATCTACGAGTGGCAGTAGCGGTAGTGGAGCACTGACTAGGAGACGCGGCTCTGATCGTTCGTTATCGGGTTCTGCGCATGAGCTAGCCGTCTGCGCCGCCCAACCACCTCGAGCTCCCGAACCGCCCCCGCACAGCCACTCACAGTCTAACCTCTGTGATATCCCGTTCAGGGATCCTTTGGACGACGACACGGCAGACGTCCTCCCCTGGGAGAATTCTACTCGGGATAGTGGATACGGGGCGAGGCGAAGGCAGGACTCCACCGCTGACTCGGGCAGTTCGTCGTCGGATGTGAGCGCTGCGTTGGCGACGAGCGAACGACGGCGACTCCCTCAGCAGAGCACGCTGGATGGTGGACTACGAGGCGCCCCGCCGCCTCTCCGCCGCTTATCCGCTGTAGAGCCGCGCCACCTCGCGCCATTGTCGTCACCGCCACATTCACCGCGTCACACGCGGGCCCACCAGCCGGCGTCACAACCAATACACGCGCCCGTCCCACCTCACCATCCCACCCCCACTATCAGCGTTAGCTCTGCACCGGATGACGAATCCGCCGAATCACCAACCAGAACAGGTTCTCCCGACGACAGGCGCTCGATCGCTCACTCCGAGGAACCTTCTTCAGTTTTTGCGTCGGCGGACGCTACGCCCACAGAACACTCGCTCGCGTTCACAGACAGATGTCCAACGGTGGATACTGCGACGGAAGTTGTTGACACGAGGACTAATGTTTTGAAAGTGGAAGTGAGAACAGCGTCTTTCGAAGCGTCGCTCGCCGCCGAGTCGATTGACTCGTGTGACGAAACCCGCTCGGAATCGACGATAGGTTCAAGTCGGAACTCTAAAATTCAATCGCGGGATAATTCGAAAGACGAGCAAGTGCTGTCTGCTGAAAACGAAATATTTGTGCGTATACCGATACCGCAGGGAGTTCCCGAGACGATTCCAACACTTGTTAAGGCGTCGTCTTTCAGCAAGGAAGAAGAACCTACACGAAATGTCGCCCTGAGTCGGTCCGATAGCGAAGTCACCGGTGTTTCAAAATCAGTTGAGGCGTCTGCAACGGAGTCAATAAGAGACGCTTCCGAGTCGCCGGTGATGGCAGCCGCGCCTGCGGAAGCGCAAGTCGAGCGACGGGTCGCGGTCGCGCCGCTCGCTGTGTGCGAGGATGTCGGCGTGGACGAGGACGCGGTCGACAAAGTGCTTCCGGTGCAGAGGGTCTCCACGTATCCGAAGGCGTCAGATTCCAATGTGGTGAAATTTGCTCATAGTGACGAACTGTCGTCGGTATCCGAATCCGACATTGTTAAATGCGATAAAAGCGCAGTCAGTGAACGTAAGCAGCGGAACGACATTTGTCCCTGGGAGGACGAGAATTGCTGCGAGAGTGACGTTCCATTTGTTAAAACTTACGCAACGCTCGGTTACTTATAA

Protein sequence:

>DPOGS212134-PA
MTSVDEFVNSSAWLIKVLIVIIVIVSVCADVFNITYVHLDRPSKRSRPHYKLALTKAIKVSGAQRGRERHAAPATDSHGGRIARGQPRPLPLLTALAMEPADVPRHRPLAFDKMESLIKEMQDPDTGVPVRSQKLFLTYVPSAFAASDVIEWIMERFNVDDSNNSEGLILANQLCQYGYLFPVSDSKVLALKDDNSLFRFQSPYYWPWQGPRVAGAGSGAPALGPDNVEYAIYLVKRTLRNKQRHGLEEYEQEALANLKKNLAAKWDFITMQAEEQVRLAKERKKGDKIVSDSQERAYWRVARPPPGTLTALESCPVPVRARHPTKPKKRTIQQITREIEHLKASLDRTRVKTSIALEALMAYSETFAAYDPWLTQPQPSNPWITDDTLFWQINSPIVEVPSEKRVQRWAVSIEELVSDPTGLQEFTSFLRKEYSHENIRFWLAVMDLRRSSTKQIPKKLEEIYEEFLKPGAPCEINIDGATADRVTEGIRSGSRYALDHAADHVYGLLLKKDCYPRFIRSDHFQRLLAEGRNVHQKKAKFFNFGGQVKKKPGSTSGSSGSGALTRRRGSDRSLSGSAHELAVCAAQPPRAPEPPPHSHSQSNLCDIPFRDPLDDDTADVLPWENSTRDSGYGARRRQDSTADSGSSSSDVSAALATSERRRLPQQSTLDGGLRGAPPPLRRLSAVEPRHLAPLSSPPHSPRHTRAHQPASQPIHAPVPPHHPTPTISVSSAPDDESAESPTRTGSPDDRRSIAHSEEPSSVFASADATPTEHSLAFTDRCPTVDTATEVVDTRTNVLKVEVRTASFEASLAAESIDSCDETRSESTIGSSRNSKIQSRDNSKDEQVLSAENEIFVRIPIPQGVPETIPTLVKASSFSKEEEPTRNVALSRSDSEVTGVSKSVEASATESIRDASESPVMAAAPAEAQVERRVAVAPLAVCEDVGVDEDAVDKVLPVQRVSTYPKASDSNVVKFAHSDELSSVSESDIVKCDKSAVSERKQRNDICPWEDENCCESDVPFVKTYATLGYL-