New model in OGS2.0 | DPOGS216175  |
---|---|
Genomic Position | scaffold1533:+ 49249-62233 |
See gene structure | |
CDS Length | 1611 |
Paired RNAseq reads   | 306 |
Single RNAseq reads   | 1017 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014166 (7e-102) |
Best Drosophila hit   | CG5149, isoform A (7e-69) |
Best Human hit | hypothetical protein LOC57707 (2e-43) |
Best NR hit (blastp)   | AGAP007765-PA [Anopheles gambiae str. PEST] (9e-127) |
Best NR hit (blastx)   | AGAP007765-PA [Anopheles gambiae str. PEST] (4e-114) |
GeneOntology terms    | GO:0005198 structural molecule activity GO:0005923 tight junction |
InterPro families   | IPR006571 TLDc |
Orthology group | MCL15406 |
Nucleotide sequence:
ATGGGTAACACTAGCAAAAAGCTGGCAGCAAAATGTGCTCTGCTAACTAAGGAGGAGCAG
AAGTATGTTGCGGCAACATTCAGGGCAGCCAGCAAGAACTCGGAGAGAATAAGGGAAGAG
GACCTCATCAAGTTTTGGGGTCCGCAAATTGATCCGAGGTTGGCTCAATATCTCACCAAT
TTTCTCTTCGGCTGCGGTCAACAGAAAACAGCCACAGTGGATTTTAACAGGTTCGCTGAG
CTCTACGTCTACAATGTTAGAGGCACTGTCGAGGAGAGAATGATGGTGACATATAACTGT
CTAGGTATGGATTACAATGAAGACGCCGAGTTGCCCTATCAACTTTTAAAAGAGTATTGC
GAGAGCATAGTGTCGACGTACATGAAGATAGTTAAGTCTTCGTCGACGAAACGCGCGTCC
ACGTGGTTGGAGAAAGGTTTCAGGGCGAGCGCCTCGCACGTCCAAAGTCTAGGTGAGGCG
GTCGCGGCTACCATCGGGGACTTGGAGACGGCGCAGCATCATTGTACAGCAACCCAACTG
TCTAAATGGTTGCAATCCAACATCCTTCTGAAGCAGCTGGCGGAGCTAGTGTACGTGAAC
CTGTATGGTATTAACAGACGTGGTGGTGACGAGAGCCCCACTCCCATGCCACCAGCCGCG
CCATCTTTGCTGCCGGCAGTTGAAGGCTTGGAGGCAATGCCGGACTACCCCGCATTCATA
GATCTCTCGCACGTCGTGTGGATCAACAGTCATCTGCCGCCGCAGCATCAGCATAAATGG
AGATTCCTCTTCTCGACCAACATACATGGGGAATCCTTCTCCACTATGACCGGTCGTATC
ATCGACCAGGGTCCATCAGTGATCATAGTCGAGGACTCCAGCGGGTATATATTCGGGGGC
TTCGCCACAGCCTCGTGGGCCTTCGGTCCAAACTTCACCGGCACCGACGACTCCTTCCTC
TTCACGTGCGTGCCTAAGATGAGAGTGTACCCGGCGACCAATTACAACGATCACTACCAG
TACCTGAACCATCACACAAAGACCTTGCCCAACGGACTTCTAATGGGTGGTCAGTTTAAT
TTCGGTGGTATCTGGATATCAGCGGAACCGTTCGGTGATGGTGCGTCCGCTGAGTCCTGC
AGCACCTTCCGCGGGTACAGGCGTCTCAGCAAGGAACCGACGTTCAGACTTCGATCACTT
GAAGTTTGGGCCGTTGGTGACAAACCTTTGCTCGATAAGGACGGGGACATGAAGACGTCT
CAGTCCTCCAGCGTCCTAACTACACATAAATCAGAACGCAATCTGCTGGAGATGATCGGA
AAACCTCAAGTCAGCGACGGACTCAGAGATAATTTCGAGGACGAGCCTACCGGGGCTGGC
TTCGACGCACTTTCCAAGAACGAACTGATGGTAGTTCCCGTTCCTCGCTCCCACTTCTAC
CGCGTACTAATTAACGGCGAACCTATTCTTAGTGACTTCGATAGCTCATTGCCCTTGGTG
GATGAGCGTGACGTAAGCATCCTGGATACGAACCCTGAAGCGAAGGCGATACTTGATATG
GCGGGGCGGACGCGTCACAGCGAAGGTCTGAGAGAACAGCCACCGTTATAA
Protein sequence:
MGNTSKKLAAKCALLTKEEQKYVAATFRAASKNSERIREEDLIKFWGPQIDPRLAQYLTN
FLFGCGQQKTATVDFNRFAELYVYNVRGTVEERMMVTYNCLGMDYNEDAELPYQLLKEYC
ESIVSTYMKIVKSSSTKRASTWLEKGFRASASHVQSLGEAVAATIGDLETAQHHCTATQL
SKWLQSNILLKQLAELVYVNLYGINRRGGDESPTPMPPAAPSLLPAVEGLEAMPDYPAFI
DLSHVVWINSHLPPQHQHKWRFLFSTNIHGESFSTMTGRIIDQGPSVIIVEDSSGYIFGG
FATASWAFGPNFTGTDDSFLFTCVPKMRVYPATNYNDHYQYLNHHTKTLPNGLLMGGQFN
FGGIWISAEPFGDGASAESCSTFRGYRRLSKEPTFRLRSLEVWAVGDKPLLDKDGDMKTS
QSSSVLTTHKSERNLLEMIGKPQVSDGLRDNFEDEPTGAGFDALSKNELMVVPVPRSHFY
RVLINGEPILSDFDSSLPLVDERDVSILDTNPEAKAILDMAGRTRHSEGLREQPPL