Monarch geneset OGS2.0

DPOGS212406
TranscriptDPOGS212406-TA3087 bp
ProteinDPOGS212406-PA1028 aa
Genomic positionDPSCF300258 - 182641-195308
RNAseq coverage1267x (Rank: top 10%)
Annotation
HeliconiusHMEL0086350.072.18% 
BombyxBGIBMGA002804-TA5e-13766.95% 
DrosophilaExn-PD1e-14343.06% 
EBI UniRef50UniRef50_D6WZ801e-16544.11%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WZ80_TRICA
NCBI RefSeqXP_974773.23e-16644.11%PREDICTED: similar to guanine nucleotide exchange factor [Tribolium castaneum]
NCBI nr blastpgi|2700132995e-16544.11%hypothetical protein TcasGA2_TC011886 [Tribolium castaneum]
NCBI nr blastxgi|2700132993e-16341.86%hypothetical protein TcasGA2_TC011886 [Tribolium castaneum]
Group
Gene OntologyGO:00056226.7e-55intracellular
GO:00350236.7e-55regulation of Rho protein signal transduction
GO:00050896.7e-55Rho guanyl-nucleotide exchange factor activity
GO:00055157.9e-19protein binding
KEGG pathwaydme:Dmel_CG37993e-140 
 K07525 (NGEF, EPHEXIN)maps-> Axon guidance
InterPro domain[568-790] IPR0002196.7e-55Dbl homology (DH) domain
[913-1011] IPR0014527.9e-19Src homology-3 domain
[836-923] IPR0119934e-06Pleckstrin homology-type
Orthology groupMCL10821 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212406-TA
ATGGTCTCCGAGCGACCATCGAAAGCTGGTCAAGCGAAACTGCTGACCATAAGGAGAAACTGTGGTGACCAAGTGTCCGTGACATCTCCGACCAAAGTGTATATGAAGACAAGTGTCAAGTGTCCGTTCTATAGCCAGAACGACGAGTCCAGGGTGTGTCAGGATGTGGATAAAGTGAAGTTCATTGATGCCCTGGAGCAGTCTTGGTGGATGAGTGTTAGGACAGTTCTCAGGGATGGAGAGTTTGAGATCCAGGATCTCATCGCGTTTATTGACTGTGACAGTAGCGACTCGGGCCATGTGTATGAGTCTCTTGAGCCACCACCGGCCGCCAATCCCACTCACCACACACCCCCGCCCATACCTGATGACGACTTTGACTCCTTTGATAGTGACACCGACACTGATGACCAGGATAAGACTGTACCTCCCGAATTGCCCCCAACTCTGCCCTCCTCGCGTCTGCCCACGCCGCCCAACGGTGGGCCCTACACTCTAACCAAGATCGCGAACGCTGCTCAGAAGAAAATGAAGCAGATTAAGAGAAATCTCACCAAACGATACTCTGTGGCGATGGACGGAAAACCGTTCAACAAATCATCGAGCAACCAAAACGGGACGTACGATGTACCCAAGAATCAAAATAAAGCGAAGTCCCCGATATACGCCAACTACGAGAAGCCCGAGGCCCAACCCGTGTACAGCAACATCAATTTCCACGATACCAAAAACATTCTCCATAAAGTAGAAAATCCCGCAGCAAAAATCAGTGGAACGTTCAAAGAAGAGCTGAAGACTGTAATAGGCGAGAAGTGTTCCACCAGGACTTGGACGTACGAGAAGAGCAAGAACGTGTCAGGAGCTGGTAGGAACGTCTATGAGAAGGGGGCGTTTAAGGCCTCGGAGAAGAGGAACAGCAATGGAGACGCTCCACCGCCGCTACCGGATAAACCGCCGCCCGAGAAACCCACGACCCCCACCAGCGACACCAAGAACAGCGGCACCCTGTCCAGGAAGGCCTACTTCTCCTTCAAGTCGAGGTTCAGGCGAGCCACCTCCATGGCCGTAGACATCAACACGGACGTTCCCAGCGCACTCAAGATCACCAATTCCACGTTCTACCTGACCGACTCCATCGACGGCGACTCCGGCTTCAGCAACTGCAGCGAGAGCGACCGCCGCGAGGGTATGCGTGCCGAGTCCTCCCCCTGCCTGGCGCTGGAGAGACCAGCCCTCCCCCCACCGCCGCCGCCCCCCCACCCCGCACACAACACGCACACTGACCTTAGCGCTGTCAACGAAGAGCTGAGGCGGTTGCTGCCGACCCTGTCTCGCAAGGAGCGTGTGGCCCGCACCAGGACCTCGTGGTACGCGGACTGCGCCCCCGGGAACGCGTCCTCACTGTACTCGGAGGCTGGGCTGTATCAGGCTGGTAATATTTCAAGTTCTTCCGGCGCCTCCTCCGGCTCGCATCCCGCCTCGCCGCTCCCCCACTCGCTGTTCACACACGAGCCGCTCTACCAGTTCTACAACGCTGCCAAAATTGAGTCCGCGTGTCGCGAGACGGGCGACTCAGACTCGGACGCCTACGAGGTTGGTGGGAGTGTGGGGGCTGGTGGTCCAGGAGGTGTAGGAGGCCAAGGGGCCCGTCCGTCCGCGATGGCACTAGTAGCCCCGCGAGGACCCGCCAGGACGCTCTGGTGCGAGGTCCCGGAGGTCCTCAACTCAGCTGTCCTCAGTTCCCTTGCACCAGCACAAAAACGTCTCCAAGAGGCGAAGTTTGAGCTGCTGACATCAGAAGCCTCGTACCTCAACTCGCTGAACGTGTTGGAGACACAGTTCATGTCTCACCCGGCCTTCAGAGATCCCCTGGTGCTTCCTCCACACGAGTTCGACACCCTGTTCGCTGCTATACTCCCAGTACGCAAATGCTCCCAGCTACTGATGGCTGACCTCGAGCGCTGCTGGCAGGAGAACATCCTGCTGCAAGGCATCTGCGACATCGTCCAGCGACACGCCAGCGCCAGATTCAAAGCCTACGTCAAGTACTGCGAGAACCAACCGCTCATGGTCAAAGCGCTGCAGCGGATGAAAGATAGACCGGCGTTCGCCAGCGCCCTCAAGAGACAAGAGAGTCACCCGCTGTGCCAGTCGCTGTCTCTGCACTCGTTCCTTCTGCTGCCCATGCAACGCGTGACACGCCTGCCGCTGCTGCTGGACGCCGTGTTGAGGCATCTGCACGCCGACGACGACGAGTACGAGGGTTGCATGCACGCACTCGCCACACTCAACGACTTCGTGTCTCAATGCAACGAGGGCGCGAGGAACACGGAGCGTGTGGAGGACATGTGGCGGCTGTCCAGGAGCGTGGTGATACCGCCCGCGATCCGCGGGGTGCCGGAGCTGGGGCCCGCGCTGGCGAGGAGAGACCGCAGGCCCGTGAGGTGGCTGGTGAGGTCGGGGGAGATGACGCAGCTGGTGTGGAAAACGGACGAGCTCAAGTTGACCTTCGGCAAGAAGTTCCACAAGCTGCCGCTGCACCTGTTCCTGTTCAACGACCACCTCATCATCACCAAGAAGAAAGGCGAGGATTGCTACTCGGTGGTGGAGCACTGTCCTCGCTCGCTGGTGGAGGTGTGTTCCAGCGAGGCGGCCGGCGTCAAGCACGCGCTGCTCCTCACACTGCTGGAGAACCACGACGGACGGACCGTCGAGATGTTGATGTCGTGTGCGAGCGAGACGGACGTACACCGGTGGACCGAGGCCCTGGCGCCGCCGGCCGCGGACGCCGGCGAGACCCTGTACGCCGGCTGGGACTGTCCTCAGGTGGCCGCCGTGTACCCTTACGCTCCACACCAACCCGACGAACTGGCGCTGGCTGAGGGAGATATAATCAACGTTACCAGGAAGACCAACGAGGGTTGGTACTACGGCGAGCGGACCCGTGACGGCGAGGCGGGCTGGTTCCCCGGCGCCTACACCGCGGAGATCGCCTCGCCTCACGTCCGGGCCAGGAACCTGCGCCAGAGGTACCGCCTGCTGGCGCTGTCCGCCACCTACCTCGGGCAGAGGAAGAAACCTCTATAA

Protein sequence:

>DPOGS212406-PA
MVSERPSKAGQAKLLTIRRNCGDQVSVTSPTKVYMKTSVKCPFYSQNDESRVCQDVDKVKFIDALEQSWWMSVRTVLRDGEFEIQDLIAFIDCDSSDSGHVYESLEPPPAANPTHHTPPPIPDDDFDSFDSDTDTDDQDKTVPPELPPTLPSSRLPTPPNGGPYTLTKIANAAQKKMKQIKRNLTKRYSVAMDGKPFNKSSSNQNGTYDVPKNQNKAKSPIYANYEKPEAQPVYSNINFHDTKNILHKVENPAAKISGTFKEELKTVIGEKCSTRTWTYEKSKNVSGAGRNVYEKGAFKASEKRNSNGDAPPPLPDKPPPEKPTTPTSDTKNSGTLSRKAYFSFKSRFRRATSMAVDINTDVPSALKITNSTFYLTDSIDGDSGFSNCSESDRREGMRAESSPCLALERPALPPPPPPPHPAHNTHTDLSAVNEELRRLLPTLSRKERVARTRTSWYADCAPGNASSLYSEAGLYQAGNISSSSGASSGSHPASPLPHSLFTHEPLYQFYNAAKIESACRETGDSDSDAYEVGGSVGAGGPGGVGGQGARPSAMALVAPRGPARTLWCEVPEVLNSAVLSSLAPAQKRLQEAKFELLTSEASYLNSLNVLETQFMSHPAFRDPLVLPPHEFDTLFAAILPVRKCSQLLMADLERCWQENILLQGICDIVQRHASARFKAYVKYCENQPLMVKALQRMKDRPAFASALKRQESHPLCQSLSLHSFLLLPMQRVTRLPLLLDAVLRHLHADDDEYEGCMHALATLNDFVSQCNEGARNTERVEDMWRLSRSVVIPPAIRGVPELGPALARRDRRPVRWLVRSGEMTQLVWKTDELKLTFGKKFHKLPLHLFLFNDHLIITKKKGEDCYSVVEHCPRSLVEVCSSEAAGVKHALLLTLLENHDGRTVEMLMSCASETDVHRWTEALAPPAADAGETLYAGWDCPQVAAVYPYAPHQPDELALAEGDIINVTRKTNEGWYYGERTRDGEAGWFPGAYTAEIASPHVRARNLRQRYRLLALSATYLGQRKKPL-