Monarch geneset OGS2.0

DPOGS208826
TranscriptDPOGS208826-TA2706 bp
ProteinDPOGS208826-PA901 aa
Genomic positionDPSCF300036 + 611841-618089
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0038662e-0951.56% 
BombyxBGIBMGA007932-TA0.086.24% 
DrosophilaPsGEF-PA1e-15841.67% 
EBI UniRef50UniRef50_Q5TPR83e-16945.03%AGAP004701-PA n=1 Tax=Anopheles gambiae RepID=Q5TPR8_ANOGA
NCBI RefSeqXP_554654.35e-17045.03%AGAP004701-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582980471e-16845.03%AGAP004701-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571076104e-15844.85%hypothetical protein AaeL_AAEL004819 [Aedes aegypti]
Group
Gene OntologyGO:00055157.5e-20protein binding
GO:00056224.3e-16intracellular
GO:00350234.3e-16regulation of Rho protein signal transduction
GO:00050894.3e-16Rho guanyl-nucleotide exchange factor activity
KEGG pathway 
InterPro domain[174-298] IPR0014787.5e-20PDZ/DHR/GLGF
[354-539] IPR0002194.3e-16Dbl homology (DH) domain
Orthology groupMCL15634 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208826-TA
ATGTCATTTGTGTCACAAACGGTTGCTGGACTGAGTCAGACTCGACGCGATGACGTCTCTCCGAGACAACATTGCTCTCACCAGCACGCTCATCGCAACCTGCAACTGAGCAGAAAAAATAGTCTACCAGAATGGGATCCACCAGAGCCTAGAGCAAGAAAAAACAGCCTCCCCGCTGACATCCATGATAAATCCGAAGTATCCCACACTGGCAGTTCGCCCATACTTGAGGAAGAGCGGAGCGAGTTTCTAGGATGTGTCTCGTTCCCTCTAAAAGATGCTATTAAGAGCGAACTCTCCGGCACTTACTTCCTGCAAGGGCGGCAGGCGCCGGCTCCGGACCGCGTGACCGAGCGACGCAAGATGGCAACCGATAATATTGAGACTAGTACCAACGAGGGTGCAAAGGAAAACACGGATAAAGGCGTATCATCCAACGGCCCATCCAATCAAGCCCAGGTGGCAGCAGCACTGCAGCGTGAGGCTGATGAGCATCTATTCTTACGGTATTTGGAGCTTGACCCACCCGAAGATCCCGCGGCCCCGAGACGCCCTCAAAGAAATGGACGGACGCCTTTCACAACTACTAGGAAGCTGATGCGTGGTTCAAACGGCGGTTTCGGATTCACAGTAGTTTGGACAAGACCACCAAGAGTTGAAAGGGTTGCGGCTGGCGGGTCGGCGGAAAGAGCAGGTCTTAGGGCCGGCGACTACATAGTATTTGTTGGTAACAAGAACGTAGTCACCGCTACCGAAGAGGAAGTTCGAAATCTTGTAAAGTCAGCTGGTCAAAATCTAAATTTAGAAACATTCCGTCGGGTACCTCAGAACGGTGCTGGGGTCAAATCACGACTGGCTCAAGTCACGATACCAGTGCCGGAATCTACTCCTCCACCAGCCCCCCGGTCTCCTCGAGCAGTAGCCCTGGCGTCCCCAGCGGCAGCTCGACCACCAACAGCATGCTCCTCCACCAGCCAATCCTTGGATAGACGCAAGCTCCACCTGCCGCAGGTCACCTTCTCCAAGGAGACCAACACCCTCCAGGCTCCAGGTATGACAGTAGATGGACGGAAGCGAGCTCTTTTGGCGGTAGTCTCAAGAGAACAACATTATGCAACATCTCTCCAGTTTGGATTGGTTCGGTTCGTTTCACCTCTTGCTGAGAGAGCCGACCTCATAGCTCCCAGCGATCATCAATTGCTGTTTCAAAATATTGAGGAGATATTTAGGCTGTCAGAAGATATCTTGGATCAAATCGTCCAAGACGATGGTGAAATACAAACATCTACGGTCGTTGCTGTTTATTTACAAAAGACTTCAACGTTTCTTAGCCTTTACAAAAAATATTGTCTTGGTCTAAAAAGAGCTGATTGTGTATTGGTCAAGAAATCGAAAGACCCAAGTGGTGCATTTTCTCGTTTTTGTACAAGTCCGCCAATACCACGAAAACGACCAGACATCACCTCATTAGTTCATAAACCACTGGAACAATTCAGGGAACTTCTTAGGTTAATGAGAACAGCTGCTAGTAGTAGTGGAGCCGGACCATACGATCAACTAGAGAAAAAACAACTTCAGCTTATCGTAGAACAGCTGCAGTCTGGCTACCAAGATGTCACTGCAGGTTCAGGTTTAATGGGTCTTGCTGGTGACGGAAAACCCTTACTATCTGTTGCTGATCTCGAAAGTAGACTCGTGTTCACTAGATGTAAGCCATTCGTTCTAAGTACGGCAGGAAGACAATGGATATTTGGGGGAGAACTATCTCGTGTGGAAGGTAGAACAATTCGGCCATACTGGGCATTGCTCTTTACAGACTTACTTCTTTTTGCCACAGTGTCACGTGATCGAGTTTTGTTTGTAACGGAAGAGCCAGTTGCTTTAGGAACTGTGTCAGAAGCTCAGTTTAATGTGAGAAAAAAAGCCACCGAGTTTCGTCTAATCTTAGGACGAGCTGGTGGTGAAAGCCCACTAGTATCGTGCGCTCCAAGAACTCCTGCTCGATCAAGACCTATCGTTTTGCGCGCTCCGTCCATCGATCTAAAGGCTGTTTGGCAGAGTTTGCTGCAAAGGCAAATTTACCGAGCACATACGACATCCGGAACGCCCCTTGGGTCCCCACTTGACTCGCCCGATCCTCAGTTGACCTTCAGTCTTGCGACCCTCGATTCCCAACAACGACAGGTATCAGGATCCAGCATACAACGACACTCAAGTCTCGCTTTATTACCGGCAAGTTCTTACCGCTCACACCTTAATATGCTTGTCGAAGAAAGACCAGAAAAAGAAGTCAGAGTTAGTTTTGACGTGAGACCCAGTTCGGCTTCTCCCCAACCCCAACCAGTAAGAGGTCCTTTGAGTACCGTTTGGAGAACAGCTCCTACACCTGACACTCCTTCCGCCAATCTATCCCCAGTAGACTCTCAAACTTTCTCTTCTCATTTTGTTTCATCTTCAAGTATAGAAAGAAATTATGAGAATGCATCTCCCAGCGCCCAGCTGACACCTGAAGCTGGAGAAGACTATTCTGAAATTTCACCTTGTCGTTGGGAATCTTTTGAAGCAGATTTAGCATTGGCTGATTTAGACGTTGATCCATCTTACAAACATGCTGTAGAGGAGCATATTTTCTTACCTGATGATACACTTCTACCACGAGTTCCCTTACCCGAGAGGCCAACACCACCCGCTTACCTTGAACTTTGA

Protein sequence:

>DPOGS208826-PA
MSFVSQTVAGLSQTRRDDVSPRQHCSHQHAHRNLQLSRKNSLPEWDPPEPRARKNSLPADIHDKSEVSHTGSSPILEEERSEFLGCVSFPLKDAIKSELSGTYFLQGRQAPAPDRVTERRKMATDNIETSTNEGAKENTDKGVSSNGPSNQAQVAAALQREADEHLFLRYLELDPPEDPAAPRRPQRNGRTPFTTTRKLMRGSNGGFGFTVVWTRPPRVERVAAGGSAERAGLRAGDYIVFVGNKNVVTATEEEVRNLVKSAGQNLNLETFRRVPQNGAGVKSRLAQVTIPVPESTPPPAPRSPRAVALASPAAARPPTACSSTSQSLDRRKLHLPQVTFSKETNTLQAPGMTVDGRKRALLAVVSREQHYATSLQFGLVRFVSPLAERADLIAPSDHQLLFQNIEEIFRLSEDILDQIVQDDGEIQTSTVVAVYLQKTSTFLSLYKKYCLGLKRADCVLVKKSKDPSGAFSRFCTSPPIPRKRPDITSLVHKPLEQFRELLRLMRTAASSSGAGPYDQLEKKQLQLIVEQLQSGYQDVTAGSGLMGLAGDGKPLLSVADLESRLVFTRCKPFVLSTAGRQWIFGGELSRVEGRTIRPYWALLFTDLLLFATVSRDRVLFVTEEPVALGTVSEAQFNVRKKATEFRLILGRAGGESPLVSCAPRTPARSRPIVLRAPSIDLKAVWQSLLQRQIYRAHTTSGTPLGSPLDSPDPQLTFSLATLDSQQRQVSGSSIQRHSSLALLPASSYRSHLNMLVEERPEKEVRVSFDVRPSSASPQPQPVRGPLSTVWRTAPTPDTPSANLSPVDSQTFSSHFVSSSSIERNYENASPSAQLTPEAGEDYSEISPCRWESFEADLALADLDVDPSYKHAVEEHIFLPDDTLLPRVPLPERPTPPAYLEL-