Monarch geneset OGS2.0

DPOGS203300
TranscriptDPOGS203300-TA3060 bp
ProteinDPOGS203300-PA1019 aa
Genomic positionDPSCF300003 - 1152054-1171535
RNAseq coverage316x (Rank: top 36%)
Annotation
HeliconiusHMEL0166540.084.76% 
BombyxBGIBMGA012252-TA0.085.67% 
Drosophilatrio-PC0.055.76% 
EBI UniRef50UniRef50_G6DIR30.0100.00%Putative uncharacterized protein n=3 Tax=Pancrustacea RepID=G6DIR3_DANPL
NCBI RefSeqXP_002424753.10.055.91%Huntingtin-associated protein-interacting protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700037870.060.91%hypothetical protein TcasGA2_TC003063 [Tribolium castaneum]
NCBI nr blastxgi|2700037870.060.85%hypothetical protein TcasGA2_TC003063 [Tribolium castaneum]
Group
Gene OntologyGO:00056226.1e-60intracellular
GO:00350236.1e-60regulation of Rho protein signal transduction
GO:00050896.1e-60Rho guanyl-nucleotide exchange factor activity
GO:00055152.4e-35protein binding
KEGG pathway 
InterPro domain[540-723] IPR0002196.1e-60Dbl homology (DH) domain
[738-869] IPR0119932.4e-35Pleckstrin homology-type
[187-288] IPR0181594e-13Spectrin/alpha-actinin
[186-286] IPR0020175.1e-13Spectrin repeat
[911-983] IPR0014522.3e-09Src homology-3 domain
Orthology groupMCL10779 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203300-TA
ATGCTCAAGTATCATCGGCGGTCGCATAGACACAAATCACCGGTCCAGTGCCTGAGATGGAGGTCGTCTGGAGAAACGGAGGCAGCGAGCACAGCGGCGGTGGAGTCGACGCTGGAGAGACTTAGAGGAACCCGAAGTGCCCTTGAAGAACTCTGGTCCGACCGTGAGAGAAGACTGGAACTCACACTGCAACTGCGACATTTCGAGAGAGATGCCCTGGAGGTGTCGTCACGTTTGGAACTATGGGGTGAAGAGTTGCAGCGCACGGAACCGCCTCGGGACCCTCAACAGGCGGAACAAGCTCTTGCAGCACATAACGAGAGCGTGGCCAGGATGCAACACGCCACCTTCCAGGTGGTGCAACAAGGACAGGAACTGGCGGCCGCAATAGCTGAGGCATCAGATGTGATGGCTTCTAGTTCCGACAGCTCGGAGGGCGCTCCTATGGATGCTCAAGCTCGCGTTCAGCTTCTTCTAGAGTTCTTACATGACAGACAATTGGATTTGGAAGAACTGGCTGAAGAGCGCCGTGCACGTTTCGAGCAATGCGTTCAACTTGGTCAGTTCCAAAAAGACGCAGCCCAAGTCGTGAGTTGGATAAGAAATGGTGAGGCGATGTTGTCAGCGTCATTCTCAATACCCGGAACACTGTCTGAAGCGGAGCAGCTGAAACGAGAACATGATCAATTCCAAGTGGCCGTCGAGAAGACCCACGCTAGTGCTGTGCAAGTTAAATACCGGGCGGACGCTCTTCGCGCTGCTAACCACTACGACCCACACACCATAAGAGAGATATCTGAGGAGGTGACAGAGAGATGGCAGCGTCTGGTGACTTGTGCGGAGGAACGTCACAAATTGGTGACGGCGTCGTTGAACTTCTACAAGACAGCAGAGCAGGTGTGCTCTGTGCTGGACTCGTTGGAGCGCGAGTATCGTCGTGATGAGGATTGGTGTGGTTCAACGAGCGCACCGGCCGCAGCCGCCTCCAACCTGGACAAGGCTTCACAGGTTGCGGCGTTGATAGTGAAGCACGGCGAACAGAAGGAGGCGTTCCTGAAAGCGTGTACTCTAGCTCGTCGCACGGCTGAGACGTTCCTAAAGTACGCGGCTCGTTCAGCTCAGGTTCACGGACAGACTGCGGCCGCTAGTAAGGCTCACCACGAACAGACACGTGCTATACTCGACACTCTACTGGCGCAAGAGAACAAGGTCCTCGAGCATTGGACGGTCCGTAAGAAGCGCCTGGAGCAGTGCCAGCAGTTCGCGTTGTTCGAGCGCTCAGCCCGCGCGGCCGTCGAGTGGATCCGCGAGACTCAGGAGCGGTGCGGTGCTGCTGCCGCTGCTGCTGCCGGCGGCGTCGCTCGCGAGTCACGGGAACGAGTCAGACTGTTGGCGCAGCTCGCCGACGGGCTCGTAGAGAAGGGTCACCCGCACGCCGTGCAGATCAAAGAGTGGGTCGCCGCCGTTGATGCCAGATATGCGGAGTTCAGTGGGTCCATGGAAGGCGGGGAGAGCGAGGCTGAGAGTGATTCTGGTGTAGCGGCGTCGTTGAGCTCCGGACAGACCAGTGAGACTGAGACTCGCGTGGAACAACCGCCGGCCGCCTCCGCTGACGATAAACGTCGGAGTGCACGTCGCAAAGAGTTTATAATGGCGGAGCTCCTGCAGACAGAGCGCGCATACGTCAAGGATCTGGAGACTTGCATAACATGTTATCTGCGGGAGATGAGAACAGACCCAGCCTCCGTACCGACCGCACTTCAGGGCAAGGAGGAGCTGATATTCGGTAACATAGAGGAGATACATCGGTTCCACGAGCGTGTGTTCTTACGCGAGTTGGATAAGTACGAGACTATGCCAGAGGATGTTGGTCATTGTTTCGTGACCTGGGCGCGAGAGTTCGATATGTACGTCTCGTACTGTAGGAACAAACCCGACAGCAATGCTGCGGTCGTCCAACACGCCGGCGACTACTTCGATAGAGTGCAGCGCAGGAAGAAACTAGAGCATCCGTTGGCCGCTTACCTCATAAAGCCGGTGCAAAGAATCACTAAATACCAGCTGCTGCTGAAAGACCTCCAGGCGTGCTGTGCCGAGGGTCAAGGAGAAATTAAGGACGGGCTGGAGGTGATGTTGTCTGTACCGAAGAAGGCCAACGACGCCATGCACCTGTCGAACCTCGAGGGCTGCGACGTGCCAACGGACAGCCTGGGCGAGGTGGTGCTCCAGGACTCGTTCCAGGTGTGGGACCTGCGTCAGATCATCAAGAAGTGCCGCGAGAGACGCGTCTTCCTCTTCGACCTGCACCTCCTGCTAGCCAAGGAAGTGAAAGACACACACGGAAAGGCTAAATACATATACAAGACTAAATTCATGACATCCGAGCTGGGTGTGACGGAGCACATCGAGGGCGATGATTGTAAATTCTCAGTGTGGACCGGTCGTGAGCCTATGGCCAGCGACTGCCGCATAGTTCTCAAGGCGCCCTCCCTCGACGTCAAGCAGACGTGGGTCAGGCGCTTACGAGAAGTCATACAGGAAACCTACTTCAGTGCGGCTCTGCAGCAGCCACCGCGCAGCCCGGCCCGGGCTCCGCCACCCAGCTCGCAGAGATCGAGCCGTGACTTCGAAGACACGGACACAGAGAATCTGGACCGCAACTCACTGGCTTCATTCGGCAGCGGCAACACTACAGACTCCGATAAGGTCATGTGTAACACTCAGTACAGTGGTAACAGTCCCGCTGGAGCTGAGATGAGCTGGGTGGTCGCCGACCACTCGTCGGGAGGAGCTGGGGAGGTGTCGGTATGTAAAGGACAGCAGGTGGAGGTGCTGGAGGCGTGGGCGGCGCGCCCCGATTGGTGGCTGGTGCGCCGGGCGGGCGAGCCTCCAGTTGAAGGAGCTGTACCCGCCGCGGTGCTGAAGCCTCAGCCGCACCAGAAGACGTCACCGTCAAGGCGACCACTCAGCCAGCCTGATGATAACATAGGTCATGAAAATGCTCGTACTGGTCGAGGCGTGGCCAAGTCACAGATTGGTCACATATTTGGACTAGGTTAG

Protein sequence:

>DPOGS203300-PA
MLKYHRRSHRHKSPVQCLRWRSSGETEAASTAAVESTLERLRGTRSALEELWSDRERRLELTLQLRHFERDALEVSSRLELWGEELQRTEPPRDPQQAEQALAAHNESVARMQHATFQVVQQGQELAAAIAEASDVMASSSDSSEGAPMDAQARVQLLLEFLHDRQLDLEELAEERRARFEQCVQLGQFQKDAAQVVSWIRNGEAMLSASFSIPGTLSEAEQLKREHDQFQVAVEKTHASAVQVKYRADALRAANHYDPHTIREISEEVTERWQRLVTCAEERHKLVTASLNFYKTAEQVCSVLDSLEREYRRDEDWCGSTSAPAAAASNLDKASQVAALIVKHGEQKEAFLKACTLARRTAETFLKYAARSAQVHGQTAAASKAHHEQTRAILDTLLAQENKVLEHWTVRKKRLEQCQQFALFERSARAAVEWIRETQERCGAAAAAAAGGVARESRERVRLLAQLADGLVEKGHPHAVQIKEWVAAVDARYAEFSGSMEGGESEAESDSGVAASLSSGQTSETETRVEQPPAASADDKRRSARRKEFIMAELLQTERAYVKDLETCITCYLREMRTDPASVPTALQGKEELIFGNIEEIHRFHERVFLRELDKYETMPEDVGHCFVTWAREFDMYVSYCRNKPDSNAAVVQHAGDYFDRVQRRKKLEHPLAAYLIKPVQRITKYQLLLKDLQACCAEGQGEIKDGLEVMLSVPKKANDAMHLSNLEGCDVPTDSLGEVVLQDSFQVWDLRQIIKKCRERRVFLFDLHLLLAKEVKDTHGKAKYIYKTKFMTSELGVTEHIEGDDCKFSVWTGREPMASDCRIVLKAPSLDVKQTWVRRLREVIQETYFSAALQQPPRSPARAPPPSSQRSSRDFEDTDTENLDRNSLASFGSGNTTDSDKVMCNTQYSGNSPAGAEMSWVVADHSSGGAGEVSVCKGQQVEVLEAWAARPDWWLVRRAGEPPVEGAVPAAVLKPQPHQKTSPSRRPLSQPDDNIGHENARTGRGVAKSQIGHIFGLG-