New model in OGS2.0 | DPOGS211803  |
---|---|
Genomic Position | scaffold306:+ 98146-100356 |
See gene structure | |
CDS Length | 2211 |
Paired RNAseq reads   | 510 |
Single RNAseq reads   | 1536 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006129 (7e-18) |
Best Drosophila hit   | asteroid (7e-67) |
Best Human hit | protein asteroid homolog 1 (1e-30) |
Best NR hit (blastp)   | PREDICTED: similar to asteroid CG4426-PA [Tribolium castaneum] (2e-93) |
Best NR hit (blastx)   | GK24697 [Drosophila willistoni] (1e-79) |
GeneOntology terms    | GO:0001745 compound eye morphogenesis GO:0008586 imaginal disc-derived wing vein morphogenesis GO:0001752 compound eye photoreceptor fate commitment GO:0007173 epidermal growth factor receptor signaling pathway GO:0003674 molecular_function GO:0005575 cellular_component GO:0007526 larval somatic muscle development GO:0007476 imaginal disc-derived wing morphogenesis |
InterPro families   | IPR006085 XPG N-terminal |
Orthology group | MCL13165 |
Nucleotide sequence:
ATGGGTGTTAGAGGATTGACTACTTACATAAATTATAATCAGAATATGTTTATGAAAAGG
TATTATTTACATGACTCAAAACTTGTAATCGATGGTAACAGTTTATGTGCCCAATTGTAT
AGATCTATGATCAGTTTCTCAGCATTCGGTGGAGATTATGATAAATATGCCTCGCATACA
AAAACCTTTTTTAAAAACTTACGTAAATGTAACATAACACCCTTTATTATATTTGATGGC
TGCCACGAGCACAGAAAACTCAAAACAGTGTTTAGCCGGCTTCGTTCAAAATTAAAAGGA
ACAGCTCGCTTGGATCCCGTAACTCAGGACAGTTTACACATCTTTCCCTATTTACTGAAA
GATGTTTTTACACAGGTTCTCAATGAGCTCCAAATATCATACACTGTATGTGCGTTTGAG
GCAGATGATGAAATAGCAGCCCTGGCTAGATATTTTGATTGCCCTGTACTGAGCTATGAT
TCTGATTTTTTTATTTACAATGTCAAGTATATACCCTTTAACACTTTAGAGGTGAAGGCA
CAATCCATAGAAGAAAAAGATAATAAATTCTTTGCATTGGAATGTAGGCTATTCACAGTG
GAATACTTTTGTAAACATTTTGGTGACCTACAGCCTGAGCTATTACCATTACTGGCCACA
TTGCTCGGAAATGATTATGTAAAGAAGAGAGTCTTCAATAAATTCTTCTCACAATTAAAG
CTGCAAAAAACTAAGAAGAAGAAGAATAGTCAGCAGAGAAATATTCATGCTTTATTTGTA
TGGTTGCAGAATGAAACATTTGATTCGGCTGTTCAAAAAGTTTTAGGGAGACTGAAAAAA
CATCAGAAGAGTAAAGTGTGTGCAATCATTAAACGGAGTGTTGAGGGTTATCACAATAAC
CAGTGTCAGTCATTAAAATATTTCAATTTATCCGAGCATGTTTCTATAAAAAGCAAGGAG
CTCAAAATGCCAGATATTATAGATGAAGATGGTAGTGATTCAGAAAATGAAAAGAGTGAA
ACAGATGTGAGTAGTGATTCAGAAGATTCTACTGAAGAAGAAAGTGTAGAAGAAGCTGTG
GAAGCTATAGAAATTAAGGACCCTGTCAAAACTGGTCCAAACCGGTTACCTGATTGGTTA
GTGGAAAAGATACGACACAACCAAATTCCGAAAATGTTTCTGAATCTATACTATCATCAC
TTCCATATTTGCAACCCACAAGCGGAGGATTTCACTGAAGACGACTCGTTTTTTTGTGTG
TTGCCAGTTCTAAGATACTTGTTTGACTTGCTGACGGACTACTCTCATGACAAATGTTTT
TACATCGGTAGAGAATTTCGTCATTACAGGAAGTTTATAATCGATAGAGATTATTCTGTA
CAGCGGCCATTTGGAATAACGTTCCCTGAACTCGGCGCTGATCAATTGAAGAAATGTTTC
TACCATTTCATTGACATAAAACTACCCAATCTTGATTGGTCTGACATTGAGCTACTGCCG
ACGAATTTCCAACTGTTTGCCATATCGCTTCTTTGGTGGATATCGTACTGTAATGTTCCA
GAATTTCATTTGCACAGTCTAGTTCTGACGTATATAATGTTGAGCGTCATCGATGAACGA
ACAGGCACCAGTCGTGGCGCTTTTCATTTCAATAACAGGCACGCTAAGAAAATCGAAGAA
TTCAGAAATAGAGTAGCAGATGAAGCTAACGATGAATTGTTTCTCAATAAAAATAAGGTT
CTATATGAAGATTGTGTCGTAGCTGCTAGTGTTCTCTTGAAACACTTTGAAATTGACGAA
AGCGTGCGTAAGAGACCTAAAAGTTATGATATTAAGAAAATACACTGTATGGCTCAGTTT
CAAGTTTCTTTGCAAGCTATAAACAGCTTAAACACATTATGCGGCTCGCCATTAGATTGT
ACAACATACTATAGATGTTATAATGGCGTATTTGTATATAACATTGCACTGAAACTGGAA
AATCAGAAGGATCCTATGAATTTCTTACGACAATATCTAAAAGGATCTCATACGGTTTTG
CTATTGTATAAAAGTATATTTTCTGTACTAAACAATTTAATGGAAAAAATGAAGTTATCC
ACTATACCGTGGTCTCCAAAAAAGAAACGTCCGAGGAGGAGAAGGAATGAGCCGGATGAA
AATGATATTTCGTTTGTTGTCAAAGGTTTTGAATCAGATGTCAAAATATAA
Protein sequence:
MGVRGLTTYINYNQNMFMKRYYLHDSKLVIDGNSLCAQLYRSMISFSAFGGDYDKYASHT
KTFFKNLRKCNITPFIIFDGCHEHRKLKTVFSRLRSKLKGTARLDPVTQDSLHIFPYLLK
DVFTQVLNELQISYTVCAFEADDEIAALARYFDCPVLSYDSDFFIYNVKYIPFNTLEVKA
QSIEEKDNKFFALECRLFTVEYFCKHFGDLQPELLPLLATLLGNDYVKKRVFNKFFSQLK
LQKTKKKKNSQQRNIHALFVWLQNETFDSAVQKVLGRLKKHQKSKVCAIIKRSVEGYHNN
QCQSLKYFNLSEHVSIKSKELKMPDIIDEDGSDSENEKSETDVSSDSEDSTEEESVEEAV
EAIEIKDPVKTGPNRLPDWLVEKIRHNQIPKMFLNLYYHHFHICNPQAEDFTEDDSFFCV
LPVLRYLFDLLTDYSHDKCFYIGREFRHYRKFIIDRDYSVQRPFGITFPELGADQLKKCF
YHFIDIKLPNLDWSDIELLPTNFQLFAISLLWWISYCNVPEFHLHSLVLTYIMLSVIDER
TGTSRGAFHFNNRHAKKIEEFRNRVADEANDELFLNKNKVLYEDCVVAASVLLKHFEIDE
SVRKRPKSYDIKKIHCMAQFQVSLQAINSLNTLCGSPLDCTTYYRCYNGVFVYNIALKLE
NQKDPMNFLRQYLKGSHTVLLLYKSIFSVLNNLMEKMKLSTIPWSPKKKRPRRRRNEPDE
NDISFVVKGFESDVKI