DPGLEAN04841 in OGS1.0

New model in OGS2.0DPOGS216175 
Genomic Positionscaffold1533:+ 49249-62233
See gene structure
CDS Length1611
Paired RNAseq reads  306
Single RNAseq reads  1017
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014166 (7e-102)
Best Drosophila hit  CG5149, isoform A (7e-69)
Best Human hithypothetical protein LOC57707 (2e-43)
Best NR hit (blastp)  AGAP007765-PA [Anopheles gambiae str. PEST] (9e-127)
Best NR hit (blastx)  AGAP007765-PA [Anopheles gambiae str. PEST] (4e-114)
GeneOntology terms
  
GO:0005198 structural molecule activity
GO:0005923 tight junction
InterPro families  IPR006571 TLDc
Orthology groupMCL15406

Nucleotide sequence:

ATGGGTAACACTAGCAAAAAGCTGGCAGCAAAATGTGCTCTGCTAACTAAGGAGGAGCAG
AAGTATGTTGCGGCAACATTCAGGGCAGCCAGCAAGAACTCGGAGAGAATAAGGGAAGAG
GACCTCATCAAGTTTTGGGGTCCGCAAATTGATCCGAGGTTGGCTCAATATCTCACCAAT
TTTCTCTTCGGCTGCGGTCAACAGAAAACAGCCACAGTGGATTTTAACAGGTTCGCTGAG
CTCTACGTCTACAATGTTAGAGGCACTGTCGAGGAGAGAATGATGGTGACATATAACTGT
CTAGGTATGGATTACAATGAAGACGCCGAGTTGCCCTATCAACTTTTAAAAGAGTATTGC
GAGAGCATAGTGTCGACGTACATGAAGATAGTTAAGTCTTCGTCGACGAAACGCGCGTCC
ACGTGGTTGGAGAAAGGTTTCAGGGCGAGCGCCTCGCACGTCCAAAGTCTAGGTGAGGCG
GTCGCGGCTACCATCGGGGACTTGGAGACGGCGCAGCATCATTGTACAGCAACCCAACTG
TCTAAATGGTTGCAATCCAACATCCTTCTGAAGCAGCTGGCGGAGCTAGTGTACGTGAAC
CTGTATGGTATTAACAGACGTGGTGGTGACGAGAGCCCCACTCCCATGCCACCAGCCGCG
CCATCTTTGCTGCCGGCAGTTGAAGGCTTGGAGGCAATGCCGGACTACCCCGCATTCATA
GATCTCTCGCACGTCGTGTGGATCAACAGTCATCTGCCGCCGCAGCATCAGCATAAATGG
AGATTCCTCTTCTCGACCAACATACATGGGGAATCCTTCTCCACTATGACCGGTCGTATC
ATCGACCAGGGTCCATCAGTGATCATAGTCGAGGACTCCAGCGGGTATATATTCGGGGGC
TTCGCCACAGCCTCGTGGGCCTTCGGTCCAAACTTCACCGGCACCGACGACTCCTTCCTC
TTCACGTGCGTGCCTAAGATGAGAGTGTACCCGGCGACCAATTACAACGATCACTACCAG
TACCTGAACCATCACACAAAGACCTTGCCCAACGGACTTCTAATGGGTGGTCAGTTTAAT
TTCGGTGGTATCTGGATATCAGCGGAACCGTTCGGTGATGGTGCGTCCGCTGAGTCCTGC
AGCACCTTCCGCGGGTACAGGCGTCTCAGCAAGGAACCGACGTTCAGACTTCGATCACTT
GAAGTTTGGGCCGTTGGTGACAAACCTTTGCTCGATAAGGACGGGGACATGAAGACGTCT
CAGTCCTCCAGCGTCCTAACTACACATAAATCAGAACGCAATCTGCTGGAGATGATCGGA
AAACCTCAAGTCAGCGACGGACTCAGAGATAATTTCGAGGACGAGCCTACCGGGGCTGGC
TTCGACGCACTTTCCAAGAACGAACTGATGGTAGTTCCCGTTCCTCGCTCCCACTTCTAC
CGCGTACTAATTAACGGCGAACCTATTCTTAGTGACTTCGATAGCTCATTGCCCTTGGTG
GATGAGCGTGACGTAAGCATCCTGGATACGAACCCTGAAGCGAAGGCGATACTTGATATG
GCGGGGCGGACGCGTCACAGCGAAGGTCTGAGAGAACAGCCACCGTTATAA

Protein sequence:

MGNTSKKLAAKCALLTKEEQKYVAATFRAASKNSERIREEDLIKFWGPQIDPRLAQYLTN
FLFGCGQQKTATVDFNRFAELYVYNVRGTVEERMMVTYNCLGMDYNEDAELPYQLLKEYC
ESIVSTYMKIVKSSSTKRASTWLEKGFRASASHVQSLGEAVAATIGDLETAQHHCTATQL
SKWLQSNILLKQLAELVYVNLYGINRRGGDESPTPMPPAAPSLLPAVEGLEAMPDYPAFI
DLSHVVWINSHLPPQHQHKWRFLFSTNIHGESFSTMTGRIIDQGPSVIIVEDSSGYIFGG
FATASWAFGPNFTGTDDSFLFTCVPKMRVYPATNYNDHYQYLNHHTKTLPNGLLMGGQFN
FGGIWISAEPFGDGASAESCSTFRGYRRLSKEPTFRLRSLEVWAVGDKPLLDKDGDMKTS
QSSSVLTTHKSERNLLEMIGKPQVSDGLRDNFEDEPTGAGFDALSKNELMVVPVPRSHFY
RVLINGEPILSDFDSSLPLVDERDVSILDTNPEAKAILDMAGRTRHSEGLREQPPL