Monarch geneset OGS2.0

DPOGS203032
TranscriptDPOGS203032-TA1908 bp
ProteinDPOGS203032-PA635 aa
Genomic positionDPSCF300068 + 633178-635393
RNAseq coverage22x (Rank: top 78%)
Annotation
HeliconiusHMEL0095370.083.12% 
BombyxBGIBMGA012269-TA0.078.16% 
DrosophilaCG9098-PA1e-6741.52% 
EBI UniRef50UniRef50_D7ELK53e-12038.84%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D7ELK5_TRICA
NCBI RefSeqXP_310126.43e-12439.89%AGAP009560-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582882446e-12339.89%AGAP009560-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2700157843e-12840.33%hypothetical protein TcasGA2_TC004107 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.3e-20protein binding
GO:00072648.9e-10small GTPase mediated signal transduction
GO:00056228.9e-10intracellular
GO:00050858.9e-10guanyl-nucleotide exchange factor activity
KEGG pathway 
InterPro domain[14-124] IPR0009801.3e-20SH2 motif
[314-488] IPR0235781.6e-13Ras guanine nucleotide exchange factor, domain
[399-495] IPR0018958.9e-10Guanine-nucleotide dissociation stimulator CDC25
Orthology groupMCL11703 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203032-TA
ATGACGTCGTCAGGGTTACTGGAGTGGGAGTTATCTTTACCATCCGAGCAGATCGTCTCTCACGGATGGTACCATGGGGCGTTGAGCAGAGCGGCTGCTGAGGCACTCCTGCAGCAGGACCGGGAGTTCCTCGTGAGGGACTCGTCATCACAACCAGATAATTACGTACTCAGCTGTCGCTCGAACGGTCAACATCTCCACTTTGTCATACAGAGGATCGTCGTTCATCCCGACACAGTGTTTGAGAGATATCAATATCAGTTCGAAGATGAAGCGTACGATACAGTCGCTGACCTTATAACGTCTTACGTCGGTTCCGGTAAACCAATATCGGCAGCATCCGGGGCTCGCATACAATATCCGGCGAACCGTTTGGTTCCTTTAACGAATTACGTATCGGGCGATGATGGCGGGACTGTGACATCGCCCAACGCTGACGGCAACTATGGAAATGCGTACAGTCTGTATAGTCACTTCGCGGCTAAGCCAGGAGCACAACTGCGCGTACCTTTCAAGAAACAGAGGTCTCACTCGTTGACTCCCATAGACATGGCGCAACACGTGACTCACGGTAAGAGTGCCAGTGCCGATGGTGTAATACAAAGTCAAACAAATAAACACGGCAAGATATTTCCCAATGAGTCGTCTTCAGGTTGTAACACGTGGGACGCGAACTTACCGACCAGTCCGCCGGCGAAGCCCGCGCGCTGCGAGCCTGCCAAGACCGACGAGCGCCGCCTGGAACTGCATCGACTCTATCACGTATCCGGTTCCGATTCCGGCAATGGCTCCGGGGACTCCACGCAGAGCTCTGCGCTCGGGGACTCCAAGAGCAGACATGAAACTGACTACAATACTACCAGCACGCCCTCTGTCAAACAACAGTTTGATTACGAACTTACCGAAGCGAGACTTTTGCAAATGGAAAACTTGAATTTTATCGTTGACTCTAGAATTAACCTCGAAAACTTTCAGAGTCAGATCATTCCACCGTCTCTCACGAAACCATTGGAATCTGAAACGCTGCATACCATTAAACTATTATTACGAACTTCCGGCCCTCGCATTTTGGCCAACCACCTAACTTATGTCGATCTACATTTTCTTCTGGATGAATCGTTACAAATTGAAGGTCTAGATAAAATGGCGTCAGGCTTAGAGTTGTGTTTTTTACCTCAGGGACGATCTCTAAGACTGGATATCATAGAGAGAGTTGAAACATTTCGCCTCCTGGTAGCCGTCACTATACTGACCTGTCAAACAGACAGACAAAGAGCAGATACTATCAACGATTGGATTCTTTTAGCCATTGAAACAAAAACGGCCCTAGGAAATCTATACGGATTCTCAGCAATAATGTTCGGCTTATGTATGCCACAGGTAGAGTGTTTAGAGAATGCTTGGAATATACTGCGGCGTGTGTACACGGACAACGCTTTCATGTTCGAAGCAAAACTGCGGCCATCCTTCAAGAGCATGAACGAGGGAACCAACCCGCTACCACCCAACACGACCACTCCTCACGTCATACCACCGGTGCTGTGCTTCCATCTGTTCGACGGTGAGGGCGGCGGCCTGCTGCAGCCGGACGACACCTTCCCCAACAGTCTCATGAACAGTCTAGAGTTCGACTTCAACGCGACGGCGGCACACCTGGAGGCGGCTCGCAACCTGCCTCAGCAGGCGGAGGCTCTCCGCAGGCGAGCGCGGCCCGTGCTGCAGCCGTCCACCGCGCCCTCGCCGACGCTGCTGGAACTCTTCCGTACTGAGATGGCAGCAGTGTTTCTATGGGGCGAGCGAGGCGCGCGCTCTCCCTCTAACCAGCGGTTCGCGCGGCTTACGGACGTGCTCACTGCCATGGCCGCTAAACTGGCCGCGCGACCCCCGGACGCAGACGACGCCGTTTGA

Protein sequence:

>DPOGS203032-PA
MTSSGLLEWELSLPSEQIVSHGWYHGALSRAAAEALLQQDREFLVRDSSSQPDNYVLSCRSNGQHLHFVIQRIVVHPDTVFERYQYQFEDEAYDTVADLITSYVGSGKPISAASGARIQYPANRLVPLTNYVSGDDGGTVTSPNADGNYGNAYSLYSHFAAKPGAQLRVPFKKQRSHSLTPIDMAQHVTHGKSASADGVIQSQTNKHGKIFPNESSSGCNTWDANLPTSPPAKPARCEPAKTDERRLELHRLYHVSGSDSGNGSGDSTQSSALGDSKSRHETDYNTTSTPSVKQQFDYELTEARLLQMENLNFIVDSRINLENFQSQIIPPSLTKPLESETLHTIKLLLRTSGPRILANHLTYVDLHFLLDESLQIEGLDKMASGLELCFLPQGRSLRLDIIERVETFRLLVAVTILTCQTDRQRADTINDWILLAIETKTALGNLYGFSAIMFGLCMPQVECLENAWNILRRVYTDNAFMFEAKLRPSFKSMNEGTNPLPPNTTTPHVIPPVLCFHLFDGEGGGLLQPDDTFPNSLMNSLEFDFNATAAHLEAARNLPQQAEALRRRARPVLQPSTAPSPTLLELFRTEMAAVFLWGERGARSPSNQRFARLTDVLTAMAAKLAARPPDADDAV-