Monarch geneset OGS2.0

DPOGS202150
TranscriptDPOGS202150-TA1887 bp
ProteinDPOGS202150-PA628 aa
Genomic positionDPSCF300162 - 229812-250554
RNAseq coverage616x (Rank: top 21%)
Annotation
HeliconiusHMEL0108850.093.15% 
BombyxBGIBMGA005255-TA8e-6145.45% 
DrosophilaCdep-PE2e-16477.75% 
EBI UniRef50UniRef50_E2BXI14e-17280.40%FERM, RhoGEF and pleckstrin domain-containing protein 2 n=4 Tax=Coelomata RepID=E2BXI1_HARSA
NCBI RefSeqXP_966992.23e-17579.66%PREDICTED: similar to Cdep CG31536-PE [Tribolium castaneum]
NCBI nr blastpgi|1892371846e-17479.66%PREDICTED: similar to Cdep CG31536-PE [Tribolium castaneum]
NCBI nr blastxgi|1892371842e-16779.66%PREDICTED: similar to Cdep CG31536-PE [Tribolium castaneum]
Group
Gene OntologyGO:00055151.9e-31protein binding
GO:00054882.9e-29binding
GO:00198987.3e-08extrinsic to membrane
GO:00080927.3e-08cytoskeletal protein binding
GO:00057377.3e-08cytoplasm
KEGG pathwaynve:NEMVE_v1g2163853e-88 
 K06082 (FARP2, FRG)maps-> Adherens junction
InterPro domain[51-249] IPR0197494.7e-59Band 4.1 domain
[244-337] IPR0119931.9e-31Pleckstrin homology-type
[138-244] IPR0197482.6e-31FERM central domain
[135-241] IPR0143522.9e-29FERM/acyl-CoA-binding protein, 3-helical bundle
[253-338] IPR0189806.7e-23FERM, C-terminal PH-like domain
[59-136] IPR0189793.9e-19FERM, N-terminal
[88-100] IPR0197505.7e-17Band 4.1 family
[558-593] IPR0148472e-12FERM adjacent (FA)
[68-87] IPR0007987.3e-08Ezrin/radixin/moesin family
Orthology groupMCL10250 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202150-TA
ATGGAAACTGAATTACCGCCCGGCAGCTGTAATTCTACTGGACATCTGTCCACTATTGGATGGCACATGCCAGGTCGTATGCACCACTCGGCGAGCACGCCGGCGGGCGTGGACGGAGGCGCCCGCACGCCGCCCGCCACGCCCAAGAAGGGTGGCAAGATGCTCGCCGTGCGAGTGCAGATGTTGGACGACTCCATATCAATGTTCCAGATACAGTCAAAGGCACACGGTAAAGTTCTGTTCGATCAAGTCTGCAGGCAACTTCATTTGTTGGAGGCTGATTACTTCGGGCTTGAATATCAAGATGCTAACGGGATAAAGTACTGGCTGGATGTGGAGAAGCCGATGTGCCGTCAGGTCGGCCTGTCGATGCTGGAGCCGACCCTCCGCTTCTGCGTCAAGTTCTACACCCCAGACCCGGCGCGGCTCGAGGAGGAGTTCACCAGATACCTGTTCTGTCTCCAAGTGAAACGTGACCTGATGCTGGGGTGCATACAGTGTAACGAGAACACCGCCGCGCTCATGGCCAGTTACATAGTACAAGCGGAGTGCGGAGATTTTGTCCCGGAAGACTACCCTGACCACACGTACCTCAGCGGGTACAAGTTCTTCCCCGGACAGGACGCAGACTCGGAGAGGAGAATCATGGAGAATCATAAGAAACATATCGGTCAGAGCCCGGCGGAGGCTGATTTAAATCTGTTAGAGACAGCGCGCAGGTGCGAATTGTACGGTATAAAGATGCACTCGGCGAAGGATCACGAGGGCGTGCCGTTGAACCTGGCGGTGGCTCACATGGGCATCGCCGTGTTTCAACACTGTACGCGCATCAACACCTTCAGCTGGGCCAAGATCCGGAAGATATCCTTCAAGCGGAAGAGGTTCCTCATTAAGCTGCATCCCGAGGGATATGGCTACTTCCGAGACGTGGTGGAATTTTTCTTCGAGAGTCGAAACGAGTGTAAGAATTTTTGGAAGAAATGCGTAGAGAACCACGGCTTCTTCAGATGTACCAGCGTACCGCGGCTTCCGCGACACAAGACGCGCGTCATGTCAAGAGGATCGTCCTTTAGTCGACTCATTTGTACACAAAAGTTGTACATCATCTGTATAACCTTGAGGAAGGACGACGTTGTATTTAATTTCTTACTTGTACATTACAGGTACAGTGCGGAGTGCGGAGATTTTGTCCCGGAAGACTACCCTGACCACACGTACCTCAGCGGGTACAAGTTCTTCCCCGGACAGGACGCAGACTCGGAGAGGAGAATTATGGAGAATCATAAGAAACATATCGGTCAGAGTCCGGCGGAGGCTGATTTAAATCTGTTAGAGACAGCGCGCAGGTGCGAATTGTATGGTATAAAGATGCACTCAGCGAAGGATCACGAGGGCGTGCCGTTGAACCTGGCGGTGGCTCACATGGGCATCGCCGTGTTTCAACACTGTACGCGCATCAACACCTTCAGCTGGGCCAAGATCCGGAAGATATCCTTCAAGCGGAAGAGGTTCCTCATTAAGCTGCATCCCGAGGGATATGGCTACTTCCGGGACGTAGTGGAATTTTTCTTCGAGAGTCGAAACGAGTGTAAGAATTTTTGGAAGAAATGCGTAGAGAACCACGGCTTCTTCAGATGTACCAGCGTACCGCGGCTTCCGCGACACAAGACGCGCGTCATGTCAAGAGGATCGTCCTTTAGGTACAGCGGTAAAACACAGAAGCAAATAGTGGAATTCGTAAGGGATAACTACGTGAAGCGGCAAACATTCCAAAGGTATTTAGATTCAGCCGGGGCTGATCTACATAATAACGGTGAGTTGGCACTTGGCATGGGGCTGGGGACGAGCCTGATCGTGGAGGGCAATGACCTCCCTAACTTCAGATAA

Protein sequence:

>DPOGS202150-PA
METELPPGSCNSTGHLSTIGWHMPGRMHHSASTPAGVDGGARTPPATPKKGGKMLAVRVQMLDDSISMFQIQSKAHGKVLFDQVCRQLHLLEADYFGLEYQDANGIKYWLDVEKPMCRQVGLSMLEPTLRFCVKFYTPDPARLEEEFTRYLFCLQVKRDLMLGCIQCNENTAALMASYIVQAECGDFVPEDYPDHTYLSGYKFFPGQDADSERRIMENHKKHIGQSPAEADLNLLETARRCELYGIKMHSAKDHEGVPLNLAVAHMGIAVFQHCTRINTFSWAKIRKISFKRKRFLIKLHPEGYGYFRDVVEFFFESRNECKNFWKKCVENHGFFRCTSVPRLPRHKTRVMSRGSSFSRLICTQKLYIICITLRKDDVVFNFLLVHYRYSAECGDFVPEDYPDHTYLSGYKFFPGQDADSERRIMENHKKHIGQSPAEADLNLLETARRCELYGIKMHSAKDHEGVPLNLAVAHMGIAVFQHCTRINTFSWAKIRKISFKRKRFLIKLHPEGYGYFRDVVEFFFESRNECKNFWKKCVENHGFFRCTSVPRLPRHKTRVMSRGSSFRYSGKTQKQIVEFVRDNYVKRQTFQRYLDSAGADLHNNGELALGMGLGTSLIVEGNDLPNFR-