New model in OGS2.0 | DPOGS210902  |
---|---|
Genomic Position | scaffold141:- 3373-5175 |
See gene structure | |
CDS Length | 1722 |
Paired RNAseq reads   | 336 |
Single RNAseq reads   | 1619 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003086 (8e-08) |
Best Drosophila hit   | tartan (5e-93) |
Best Human hit | leucine-rich repeats and immunoglobulin-like domains protein 2 precursor (8e-28) |
Best NR hit (blastp)   | PREDICTED: similar to leucine-rich repeat-containing protein 4B [Tribolium castaneum] (7e-146) |
Best NR hit (blastx)   | PREDICTED: similar to leucine-rich repeat-containing protein 4B [Tribolium castaneum] (9e-146) |
GeneOntology terms    | GO:0005886 plasma membrane GO:0007155 cell adhesion GO:0007424 open tracheal system development GO:0016477 cell migration GO:0005515 protein binding GO:0035147 branch fusion, open tracheal system GO:0008045 motor axon guidance |
InterPro families    | IPR001611 Leucine-rich repeat IPR003591 Leucine-rich repeat, typical subtype IPR000483 Cysteine-rich flanking region, C-terminal domain |
Orthology group | MCL10295 |
Nucleotide sequence:
ATGCAGTTTTACACGGAACTTCAACACTTGGACCTGTCTCAGAATCATCTCGTCAGCATA
CCAATGAAAAACTTTGCATATCAACGAAAGTTACAAGAACTCCATCTTAACCATAACAAA
ATATCTTCAGTCACAAACACGACATTCCAAGGACTCAATTCATTGACCGTTCTCAACCTG
AAACGTAACTTTTTGGAAGAACTTACAAATGGTGTATTTTCTACACTGCCGAGACTAGAA
GAATTGAACTTAGGACAAAATAGAATATCAAAAATAGAGCCGAGAGCATTCGCTGGATTG
TCTGCTTTGAGAATTCTTTATTTGGATGACAACGAGTTGAGTTCGGTCCCAACAACATCC
TTTAGTCTTCTAGGCAGTCTCGCCGAGTTACACGTTGGCCTTAACGCTTTTTCTTTTTTA
CCTGATGATGCTTTCGCGGGTCTCAATAGGCTGGCAGTATTGGACCTTAATGGAGCTGGA
CTCTTTAATATAAGCGACTTTGCATTTAGGGGTCTCCCAGGATTAAGAAGCCTAAACCTT
TTTGGGAACCGATTGAGTGTGGTTCCTACGCAACAGCTTTCTAGCTTGACGAGACTCGAA
GAGTTATATATAGGCCAAAACGACTTTATCGTTTTAGAAAGTCACTCATTTAAAGGATTA
AAAAATCTTAAACTTATAGACATAACGGGAGCGACTCAACTTAAACGAATAGAAAAAGGC
GCTTTCGAAGATAATATCAACTTGGAATCTATTGTATTAACAAATAATAAAGAATTGTCC
ACCATAGAAGATTGTACTCTTCTAGGCTTGCCTAAATTACGACATGTATCATTGAGAGAT
AATGCCATAAAAGTGCTCAGTGAGAGCGTATTTGTAGGAAAAGAATTGAAGCAACTCGAT
TTAACAGACAATCCAATCATTTGCAACTGCAAAATTCTATGGTTACAGCAATTATTAAAT
GAGAAGAGCAATTTTTCTCAAGTGCAATGTGCCAGTCCAGAAAATTTAAAAGACAAATAT
TTAAAAACATTGACCGCCGAGGACTTGGAATGTGTTTTATACGATAGTCGACGGCAAACA
ATTATATGTATTGTAGGATTCGCGTGTCTCGCTGTTGTTGCAACACTGTTACTAATATTA
TACAGATATCGGAAGAGCATGCAGGAGAAACTCAAGGATTATAAGTGGAATAAGGGTCGT
AAGAATTTAGAATACCACAAACCCATTTCCACGGAGGAGGACTGCATCCTAGAGCTTTAC
GCCTCCCCGAGCGTTTTCTTCATGTCAGGCGGCCAGGGCTCCGCGCGGCGGCCGCAGCGC
TCGGGAGGCGGCGGCACTATGCCCGCCAACGGTTTCACGTACATCGGAGGCGATGGACGC
CACCATCAACCACAAACCCTCAACAACGGTGCACCAGCGCACCACCTCAACAACGGCTCA
TTGCGTTCCTTGCCAGACAAAAAGAACCGCAATGGCGTCGTCTGTCACCCTGAAAACTTC
CAACGTAATCTCGACACCCGATATTCGAGGAAACAGGAGAATGGTTACATACGTAACTCG
GAAACCATAATAGGTTTTCCTCGGGACCGGGAGAGGGAGCATGACTACGAGCGGGATGTC
CCCGACTACAGCGAGCCAGAGTACTCCATCATCCCTGAGAGCTACGGCCGACCGGAGGAC
TTCCCTCGCTCGTGCAGCCGCTCCAACACCTTCAACTGTTGA
Protein sequence:
MQFYTELQHLDLSQNHLVSIPMKNFAYQRKLQELHLNHNKISSVTNTTFQGLNSLTVLNL
KRNFLEELTNGVFSTLPRLEELNLGQNRISKIEPRAFAGLSALRILYLDDNELSSVPTTS
FSLLGSLAELHVGLNAFSFLPDDAFAGLNRLAVLDLNGAGLFNISDFAFRGLPGLRSLNL
FGNRLSVVPTQQLSSLTRLEELYIGQNDFIVLESHSFKGLKNLKLIDITGATQLKRIEKG
AFEDNINLESIVLTNNKELSTIEDCTLLGLPKLRHVSLRDNAIKVLSESVFVGKELKQLD
LTDNPIICNCKILWLQQLLNEKSNFSQVQCASPENLKDKYLKTLTAEDLECVLYDSRRQT
IICIVGFACLAVVATLLLILYRYRKSMQEKLKDYKWNKGRKNLEYHKPISTEEDCILELY
ASPSVFFMSGGQGSARRPQRSGGGGTMPANGFTYIGGDGRHHQPQTLNNGAPAHHLNNGS
LRSLPDKKNRNGVVCHPENFQRNLDTRYSRKQENGYIRNSETIIGFPRDREREHDYERDV
PDYSEPEYSIIPESYGRPEDFPRSCSRSNTFNC