Monarch geneset OGS2.0

DPOGS213344
TranscriptDPOGS213344-TA2028 bp
ProteinDPOGS213344-PA675 aa
Genomic positionDPSCF300109 - 374219-378898
RNAseq coverage499x (Rank: top 25%)
Annotation
HeliconiusHMEL0163593e-11054.52% 
BombyxBGIBMGA009151-TA0.060.97% 
DrosophilaRhp-PA0.050.14% 
EBI UniRef50UniRef50_E2AKP70.055.59%Rhophilin-2 n=16 Tax=Coelomata RepID=E2AKP7_CAMFO
NCBI RefSeqXP_971529.10.057.93%PREDICTED: similar to rhophilin [Tribolium castaneum]
NCBI nr blastpgi|910856650.057.93%PREDICTED: similar to rhophilin [Tribolium castaneum]
NCBI nr blastxgi|2700100280.057.47%hypothetical protein TcasGA2_TC009368 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.1e-18protein binding
GO:00071655.4e-18signal transduction
GO:00056225.4e-18intracellular
KEGG pathway 
InterPro domain[93-485] IPR0043282.2e-100BRO1 domain
[514-610] IPR0014781.1e-18PDZ/DHR/GLGF
[6-80] IPR0110725.4e-18HR1 rho-binding repeat
[21-86] IPR0008611.2e-15HR1 repeat, rho-binding
Orthology groupMCL12518 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213344-TA
ATGTTCTCTGTCTCAGGCTCCGACCCCCGCGTGGCGACCTGCCGCGGCCGTCTCCAGACCCGCAGGTGTCAACTCAACCAGGAGATCAACAAGGAACTGCGGCTGAGGGCGGGCGCGGAGAACCTGTTCAAGGCTACCACCAACAGGAAGCTGAGGGAGACGGTGGCCTTGGAGCTGAGCTTCGTCAACTCCAACCTCCAGCTGCTGAAGGAACAGCTGGCCGAGCTCAACTCCTCGGTGGAGCTGTACCAGAACGACAGCAAAGAATGTGTGATGCCCATGATCCCGCTGGGCCTGAAGGAGACCAAGGAGGTGGACTTCCGCGAGCCCTTCAAGGACTTCATCCTGGAGCACTACAGCGAGGACGCCGCGGCCTACGACGACGCCATCTCCGACTTCATGGACATGAGACAGGCCACGCGCACGCCGGTGAGGTCTTCAGCTGGCGTGGCGCTGCTGTTCAAGTACTACAACCAGCTGTACTACATCGAGCGCCGCTTCTTCCCGCCGGACAGGTCGCTCGGGGTCTACTTCGAGTGGTTCGACTCCTTGACCGGCGTTCCGTCGTGCCAGAGGACGGTCGCCTTCGAGAAAGCCTGCGTGTTGTTCAACATAGCCGGCATCTACACGCAGATAGGAGCCAAACAGGAGCGCTTCACGTGCTCGGGCCTGGACGGGTCCGTGGACGCGCTGCTGCGGGCGGCCGGCGCGCTCCGCTACATACACGAGAACTTCACCAACGCTCCCTCCGTGGACCTCGCCGCCGACACGCTGCTGGTGCTGGGCGCGCTCATGACGGCGCAGGCCCGCGAGTGTCTGTTCGAGAAGCTGCAGCTGCAGGCCAGCGAGTGTCGCGACCAGCTCGGAGACCAGCTTAGCTTGGACCTGGCGCAGGAGGCCGCACGCCTGGCAGACACCTACCGACAGCTGTACGAGAAGATGCAGACGGAGGGAGTCATCAACTACGTGCCGTACTCCTGGGTGTCGCTGGTGCACGTCAAGGCGGAGTTCTATAGAGCGGTGTCTCACGAGTACTGCGCCGCGGGGCTCCTGGCACCCGAGGGGCCGCCCGGCGACAGACTGCGGGACCTCTACCAGCCGGACGGACACGGCGAGCAGGCCTCGTGCGAGAGCTCTCTGCCGGCGCTGGGTCGGGCTCACCTCGCGGAGGCCCTCGCGGCCTACGAGGAGGCGCTCAGGCTGCAGAGGATGTGTAGGGAGCTCCGGAACAAGCAGTCTCTGTCCCGCCTGATGAGCGTGAGTCGGGAGCGAGCCGCGGGGCTGGTGGCGCCACCCGCTGACGACTTCGACGACCTCATCGACGCTCCGCACATAGCCCCATCGTCCAAGTTCCAGCTGGCGCTGACTCCGCCCGACTTCTCCCAGCACCGGGTGGAGGACCTGTTCCGGTCCCTGGGACCCATCGCCGTGTTCTCCGCCAAGCGCCACTGGAGCGCCCCGCGTCTAGTGACGCTCCACCGGCACGCGGGCGGGCCTCGCAGGAGGGACGGCGCGGGCGACGACGACTACGTCACCAAACAGAACGGAGTCTACATCAGCAGCTTCGACGGAGAATACCACCGCAGCCGGGCCCGGGAGGCCGGGAGAGCGCCGGCGGACGGGGAGGGGTTCGGCTTCACCGTCCGCGGAGACGCGCCCGTCATGGTGGCCGCCGTGGAGCAGGACTCGCTGGCTGATATGGCGGGCATGCGGCCAGGAGACTTCATAATGAGCGTCGGCGACAGGGACGTGAAGTGGAGCTCGCACGAGGAGGTGGTGCGGCTGACGAGGGCGGCCGGCGACAGACTCACCCTCAGGCTCGCCTCGCCCATGGACCAGGGAGCCAAGTCGTCCCCTAACGGAGGCCGTCAGTCCCGCTCCAACCAGGGCTCCGTGTCCGCGGCCTCCACGTCCTCGGGGTCCACGGCGGCCGCGCGCCCCAGACGAGCTCCCTCCTGGAACCCCTTCAAGAAGAACGGTGCGAGGGACAACAGCCAGCACAGACACGCCAACCTCGTCTACCGATGA

Protein sequence:

>DPOGS213344-PA
MFSVSGSDPRVATCRGRLQTRRCQLNQEINKELRLRAGAENLFKATTNRKLRETVALELSFVNSNLQLLKEQLAELNSSVELYQNDSKECVMPMIPLGLKETKEVDFREPFKDFILEHYSEDAAAYDDAISDFMDMRQATRTPVRSSAGVALLFKYYNQLYYIERRFFPPDRSLGVYFEWFDSLTGVPSCQRTVAFEKACVLFNIAGIYTQIGAKQERFTCSGLDGSVDALLRAAGALRYIHENFTNAPSVDLAADTLLVLGALMTAQARECLFEKLQLQASECRDQLGDQLSLDLAQEAARLADTYRQLYEKMQTEGVINYVPYSWVSLVHVKAEFYRAVSHEYCAAGLLAPEGPPGDRLRDLYQPDGHGEQASCESSLPALGRAHLAEALAAYEEALRLQRMCRELRNKQSLSRLMSVSRERAAGLVAPPADDFDDLIDAPHIAPSSKFQLALTPPDFSQHRVEDLFRSLGPIAVFSAKRHWSAPRLVTLHRHAGGPRRRDGAGDDDYVTKQNGVYISSFDGEYHRSRAREAGRAPADGEGFGFTVRGDAPVMVAAVEQDSLADMAGMRPGDFIMSVGDRDVKWSSHEEVVRLTRAAGDRLTLRLASPMDQGAKSSPNGGRQSRSNQGSVSAASTSSGSTAAARPRRAPSWNPFKKNGARDNSQHRHANLVYR-