Monarch geneset OGS2.0

DPOGS215689
TranscriptDPOGS215689-TA1620 bp
ProteinDPOGS215689-PA539 aa
Genomic positionDPSCF300041 - 621707-627495
RNAseq coverage402x (Rank: top 30%)
Annotation
HeliconiusHMEL0040650.083.12% 
BombyxBGIBMGA003588-TA0.075.55% 
DrosophilaCG5022-PA1e-12844.92% 
EBI UniRef50UniRef50_D6W6N91e-15651.97%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W6N9_TRICA
NCBI RefSeqXP_001850737.12e-13146.06%band 4.1 [Culex quinquefasciatus]
NCBI nr blastpgi|2700145295e-15651.97%hypothetical protein TcasGA2_TC004143 [Tribolium castaneum]
NCBI nr blastxgi|2700145297e-15251.79%hypothetical protein TcasGA2_TC004143 [Tribolium castaneum]
Group
Gene OntologyGO:00055157.9e-25protein binding
GO:00054881.5e-13binding
GO:00198984.2e-11extrinsic to membrane
GO:00080924.2e-11cytoskeletal protein binding
GO:00057374.2e-11cytoplasm
KEGG pathwayrno:3130521e-47 
 K06107 (EPB41, 4.1R)maps-> Tight junction
InterPro domain[8-217] IPR0197493e-48Band 4.1 domain
[212-302] IPR0119937.9e-25Pleckstrin homology-type
[16-94] IPR0189793e-21FERM, N-terminal
[89-212] IPR0197485.5e-20FERM central domain
[221-305] IPR0189808.4e-19FERM, C-terminal PH-like domain
[92-140] IPR0143521.5e-13FERM/acyl-CoA-binding protein, 3-helical bundle
[46-58] IPR0197501.3e-12Band 4.1 family
[26-45] IPR0007984.2e-11Ezrin/radixin/moesin family
Orthology groupMCL11622 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215689-TA
ATGTTTAAGAGTCGAGGTGATAGCAATATTGTTTATAAATGTACTGTGAGACTACTAGAAGATACTGAAATATTAGAATGCGAATTTCACCCATCATACAAGGGTAAATTCCTGTTAGAACATGTTTGCCAACAATTAAATCTGACTGAAACTGATTACTTCGGCCTCCGATATGTGGATGGAAATGGTCAAAGGCATTGGCTGCATTTAGCCAAAGTTATATTGAAGCAAGTAAAAGATGCCTCACCCATATTATTTAGTTTTCGGGTAAAGTTTTACCCACCGAACCCTCTACTGCTGAAGGAGGATGTAACGAGATTTCAAATATATCTCCAGTTGAAACGTGACTTACTTCATGGCAGGTTATATTGTACGGCAAACGAAGCTGCTATGCTTGGAGCACTTATAATACAAAGTGAGCCTCTTGACGATTTTAGTATTGAATTACGCTTTATATATTTAGCGTTTCGGTTGTTGCTTCTAAATTTTTATAATATCAAAAGACAGGAGCTACATCGTGAGTCTATAGAGGGCGCTACTGGCGGCGGCGCGTGTACGGACGGCGGAGTCCGAGGTCTGTCCACACAGGAGGCCGAGCAGACCTTTCTACGTTTAGCGTGCACCTTCGACACGTACGGAGTAGACCCTCATCCTGTCAAGGACCATCGTGGTAACCAACTGTACTTGGGCATCAACCACACCGGGATCCTGACATTCCAGGGCAGTCGGAAGACACATCACTTCAAGTGGGCCGAAGTCCACAAGATAAACTTCGAGGGTAGAATGTTTATTGTGCACTTAAATTATCCCGAGAAGAAGCACACAGTCGGTTTCAAATGTCCAAGCGGAGCCGCGTGCCGGCACGTGTGGTGCTGCGCCATAGAACAAATGCTGTTCTTCACGCTGGCGTCATCTTCCGAGGCGTGTGTGTACTCCGGCGGCGGTCTGTTCTCGTGGGGAACTAAGTTCAAGTACGCGGGGAGAACTGAGAGGGAAATATTAGAGAGAGATGGACTTCTGGCTACCACTAGAGACGATCAGGAGGAAGGTTCCAGTGTGGGCGGGAAGAGAAAAGCATCGAGTGTTCCAGCGACCCCGTCCACACCGATGACCGGAGACTTTGGTTACTCAAGCCTGCCCAGGTCGACTCACAGTGCTCCCTTAGAAGAAAGTGCAGTGGACGGTGACCTCTGTGTCGGAGGCTGTTTGTTGGGAGGACCCGCTGTGGACATAGCGCTGACCTGCTGCGATCACCTCAGACACAGCGATGATAAACCACCGTGTCCCGAATACAGCCCGCGCGATCCGTTCGAACACTCGTCGAGTGAGTCAGCGGCGACGACCCACGTGACCTACGTGTCGCAGACCTCTCACGCGGCGGAGGAGTGGCGCCCGCCGGCCCTGCAGCCTCCCCCGCGGCCTTTCAACCTCCTGAGGGCGTTCGTACCTTCCTTCCTATTCGTGTGCCTCTCGCTTTCCCTCGCCATCCTCCTGCTCTTCGAGACCGACTGGCCCCTACTCCGGCCTCTCAAACGCAAACCCGAACTGGTTTCGCTTAGACACCACTACTACGCTCCCCTAAAGGAGTATCTCAAGAAGAAAATCGTCGAGTTATTCTGA

Protein sequence:

>DPOGS215689-PA
MFKSRGDSNIVYKCTVRLLEDTEILECEFHPSYKGKFLLEHVCQQLNLTETDYFGLRYVDGNGQRHWLHLAKVILKQVKDASPILFSFRVKFYPPNPLLLKEDVTRFQIYLQLKRDLLHGRLYCTANEAAMLGALIIQSEPLDDFSIELRFIYLAFRLLLLNFYNIKRQELHRESIEGATGGGACTDGGVRGLSTQEAEQTFLRLACTFDTYGVDPHPVKDHRGNQLYLGINHTGILTFQGSRKTHHFKWAEVHKINFEGRMFIVHLNYPEKKHTVGFKCPSGAACRHVWCCAIEQMLFFTLASSSEACVYSGGGLFSWGTKFKYAGRTEREILERDGLLATTRDDQEEGSSVGGKRKASSVPATPSTPMTGDFGYSSLPRSTHSAPLEESAVDGDLCVGGCLLGGPAVDIALTCCDHLRHSDDKPPCPEYSPRDPFEHSSSESAATTHVTYVSQTSHAAEEWRPPALQPPPRPFNLLRAFVPSFLFVCLSLSLAILLLFETDWPLLRPLKRKPELVSLRHHYYAPLKEYLKKKIVELF-