DPGLEAN00234 in OGS1.0

New model in OGS2.0DPOGS207706 
Genomic Positionscaffold1416:+ 8531-15909
See gene structure
CDS Length2427
Paired RNAseq reads  495
Single RNAseq reads  1265
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009939 (5e-46)
Best Drosophila hit  Poly-glutamine tract binding protein 1 (8e-88)
Best Human hitkin of IRRE-like protein 3 isoform 1 (5e-22)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC003365 [Tribolium castaneum] (2e-101)
Best NR hit (blastx)  GK11515 [Drosophila willistoni] (2e-91)
GeneOntology terms  GO:0008267 poly-glutamine tract binding
InterPro families




  
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
IPR013783 Immunoglobulin-like fold
IPR013098 Immunoglobulin I-set
IPR013162 CD80-like, immunoglobulin C2-set
IPR007110 Immunoglobulin-like
Orthology groupMCL12832

Nucleotide sequence:

ATGGGGGGTGACTGCTCACTGTGGGTGAGAGCGGCGACTTTGCAGCTGGACGATGGACAA
TGGCAGTGTCAGGTTACAGCCAGCAACTATGATGTGCAGGATGCTTTATCGAGTCCACCA
GCGGCGCTTGCAGTCAGGGTGCCCCCACAGACTCCAAGGATCCTGTATAATAGCTCTCAT
ATCATGCCTGGACAGAACATCACAGTCCCTGCCGGAGGTCGGGCAACTGTCGTTTGCGAA
GCCAGATACGGCAACCCACCCGCTTATATAGAATGGTATTTGGAGAAGGAACGCCTAACA
GCTTGGAGTCAAACCAACTCCTCGGAGGTGGAGCGGCCCAGGGTGTGGGCTGCAAGATCT
GTTCTGGAGCTTGGGGCGACACGATCAGCTCACGGCAGACAGCTGGCTTGTCGGGCGCAC
CATCCATCATACCCTTCCCCGTATTATAGAGACTCTTACACCATGTTGGACGTTACTTTC
GTCCCCGAAGTATCAATCGTTGGAGCTGACTCGAGTTCTCTCACGAGTCTAGAGGAGGGT
TCAAGTGCTCTAACTCTTGAATGCAGAGCCGACGGCAATCCTAGCCCTTATGTGTGGTGG
ACTAAGGACGGACAGGTCATAGCAACGAACGGCCACAAGTTGATAAGGGCACCGGTATCC
AGAAACGATTCTGGTATCTACGGATGTCAAGCGAGAAACTCTTTGGGTACTTCGGATTCT
GTCAAAATAGAAATCGATGTGAAATTTTCTCCTCGAGTCATCTGGATAGGACCAGACACG
GTCGTGGAAGCCAACCTGTTCTCAGAAGTAACCCTGGAGTGTAAGGCTGAAGGCAACCCG
CCACCCTCCTACCAGTGGTATCATAATCCAAACCTGTCATCCATGAGCGGTCATCTTGAT
GACGGATATCCGATATCATCGACACCACAGCTGTTACTCCACAACGTGTCTTACACGCAG
CATGGAAGGTACACGTGTATAGCGACCAACCATATTGGACTCGAAGAAAGGAGTCACCAA
TCGGAGGCGATCACATTGAACGTACTTGGACCGCCAGTTGGCGCTGAGACAGGTGTGTCG
CACGCGTGGTCGGGGGGCGAAGCTCGTGTTCAGGCCGCTGTGTGTGCGGACCCTCCACCG
AGGAGAGCCGCCTGGCTGTGGGGGAGTCTCAGACTGGACGTGCCATCTCATGTTGGTCGT
TACAAAGCCCTGGAACCCAGCGACGTGGACGGCTGCTATCGCTACACTCTCTTGGTCAGC
GGCACTGGTGCAGCGGACGCAAGAGTTTACGTGCTGCATGTTGAAAACGAAAGAGGATTC
TCCTCACACTCTGTGTCGCTTACTGTACACGATAATTTTTCGCTAACAGAGACAGCGCAC
GTGGCTCCGCTGATCGCCGCAGCGCTGCTCGTTGCTGCCCTACTTGTGCTGGTGCTGTGT
CTGCTGATTCGCTGCCGTCGGAGAAGAGAGAGCGTCGAATATAAGACTGATGACTTAGAT
AGCGAGAAGACCGTCCTGCCAGCTGATGCTGTGTACTCCCCCCGCGACCCTCCTCGCGCT
CCCCCTCCCCGGGACCGGGTGCCAGCGGGAGCGCCCGGCGCGGGGGGCCGGTACTCCCCT
GGCGCGTTGCAGGCAGCGTTGACGTCGTTGGAACGAGAAGCCAAAGAAAACGTTTATGCG
ATACTCGAACCGCATATACAATACGACTTAACTATAAATAAGCACGGGGAACCGCCTAAG
TACTTATCGCTATCAAAGCCTCTAACTCACCAAAGAAAGCACGCTTCTGAACAGCAAAAT
ATTAATGATATTAATTACAATAAATTCATAGACAATAGATTTTCCACATTTAAAAATGAT
CATAAAATACGAAAATCAAATAGAAGTAAACGTAACAATGATAGAAGCTTTGTAAGCAAT
AGTAACAAAGATGTAAAATATTCCATTGATCCCCAATTTGGAATGATTAGAAATGGAAAC
AAAGATGTGTTCAACGCACAACATAGTTCAAGTAGAAACAACACACCTATTAGTTATATT
AATAAGGGACGAATGTACGAAAGGAGTTCCGGAAGGAGTAATATAAGAAAAAATTATGAA
AAAGATGTTGATATGATCAAGATAGACGTTAAAAGAACTTCGAATAGAAGAAAAATAAAA
GACATAGATGTACAAAAGCAGGGAATTCATGAAAATGAAAAGGAATTTTCATTTGAAGAC
TTGCAGGTTGAGCATAAAGTACAAGTCAATGCTGAAGACGACTCGATGCCAAAGCCGGGT
AAAGTGAAAGAAATAGCATCAAGATTTAACAGAAGCGCGGAAAATACAGAAACACCTGTG
ACTGTGAAAGTGAATAGACCAAAGAACGTACAAAGTTTCGACCAAGGTTACTTGGATCAC
GTTTTTCCGGATGCTGTAGAAATTTAG

Protein sequence:

MGGDCSLWVRAATLQLDDGQWQCQVTASNYDVQDALSSPPAALAVRVPPQTPRILYNSSH
IMPGQNITVPAGGRATVVCEARYGNPPAYIEWYLEKERLTAWSQTNSSEVERPRVWAARS
VLELGATRSAHGRQLACRAHHPSYPSPYYRDSYTMLDVTFVPEVSIVGADSSSLTSLEEG
SSALTLECRADGNPSPYVWWTKDGQVIATNGHKLIRAPVSRNDSGIYGCQARNSLGTSDS
VKIEIDVKFSPRVIWIGPDTVVEANLFSEVTLECKAEGNPPPSYQWYHNPNLSSMSGHLD
DGYPISSTPQLLLHNVSYTQHGRYTCIATNHIGLEERSHQSEAITLNVLGPPVGAETGVS
HAWSGGEARVQAAVCADPPPRRAAWLWGSLRLDVPSHVGRYKALEPSDVDGCYRYTLLVS
GTGAADARVYVLHVENERGFSSHSVSLTVHDNFSLTETAHVAPLIAAALLVAALLVLVLC
LLIRCRRRRESVEYKTDDLDSEKTVLPADAVYSPRDPPRAPPPRDRVPAGAPGAGGRYSP
GALQAALTSLEREAKENVYAILEPHIQYDLTINKHGEPPKYLSLSKPLTHQRKHASEQQN
INDINYNKFIDNRFSTFKNDHKIRKSNRSKRNNDRSFVSNSNKDVKYSIDPQFGMIRNGN
KDVFNAQHSSSRNNTPISYINKGRMYERSSGRSNIRKNYEKDVDMIKIDVKRTSNRRKIK
DIDVQKQGIHENEKEFSFEDLQVEHKVQVNAEDDSMPKPGKVKEIASRFNRSAENTETPV
TVKVNRPKNVQSFDQGYLDHVFPDAVEI