New model in OGS2.0 | DPOGS209889  |
---|---|
Genomic Position | scaffold82:- 81377-84688 |
See gene structure | |
CDS Length | 3312 |
Paired RNAseq reads   | 1410 |
Single RNAseq reads   | 3902 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000187 (0.0) |
Best Drosophila hit   | gryzun, isoform B (7e-141) |
Best Human hit | foie gras isoform a (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to FLJ12716-like protein [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to FLJ12716-like protein [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families    | IPR021773 Foie gras liver health family 1 IPR012880 Domain of unknown function DUF1683, C-terminal |
Orthology group | MCL13444 |
Nucleotide sequence:
ATGGCGACTCAGCCGAGTGACAACACAGAATTTCCGCCTGAAATCATTCTTAAGCCCCTT
GCCTTGATAGGGTTGTCAGGGCTTGATACGGTGAATAATGCAATCCACAAAGCTATATGG
GATGCCTTCTCAAACAACCGCCGACCGGACCGAGCCGCCGTGAGGTTCAAGTTGTTGAAT
AACACGTTCGAATTTCCAGTGGTGAAGCCTAAGAGAAACTCCTATGAATGGTACATCCCC
AAAGGTATATTGAAGAAAAACTGGATAACTAAACGTGTATCGTTAATTCCGGCCGTGGTG
GTTATTTTTTATGATATGGAGTGGAATGATCCTCAGTGGAACGAAAAGATCATCGAGTGT
GCGTCGAGAGTGCAGTCGATACGGGCAGCGGTGGAGGGACACGCTACACGTGTCGCTGTT
GTGGTTGTACAGAGTGGACTTTCACCCCCACCATCGGAGTACATGCTCGGTGCTGAAAGA
GCACAGGCACTTTGCTCAGCATGTGAAATACAATCTAAGTCTCTCTTTGTACTCCCTCAC
AGCGATCACCTCATGGGTTACATTATAAGACTAGAAAATGCTTTTTATGATATTGCACAA
AATTATTATCATCACGAAACCAAGAACATCAAGCAGCATAGAGATCATCTGAATAAGACT
ACCCATCAGTATTTGTTTGTTAGACATCAGTTCAAACTAGGCTTCCTCAATGAACTCAAG
CAAGACATAAGCACGGCCCACAAACACTACATGCATGCATATAACAACCTCCTTGATACC
AGACAAGTAGATACTAATGTACATGAAATACGAACCGTGTCTGGTTACATCAATTATAAA
CTATGTAAGCTGCTGTTTGCCTTAAATTTGCCACGAGATGCAATTGCACAAGTTAAGTCA
CATATAGAGCGCTACAAAAACAGAATTGGACCCACTGAACTGTTGTTTGAGCATTATGGC
TGGATTGCCAGGCAGTATAGTGCCTTTGGAGAATTATTTGATGAAGCTATAAGGTTAGGG
CTTCCGGCAATTCAATCCCAACATCCTGGCTTTTATTACCAGTATGCAGCTCAATTTACA
GTGAAAAGACGGCAAGCCATGAGGTCGGTATGCTGTGATGCTTCACACTATCCACCTGCC
CCAGACCCCATGGAGGGTATTGTGGAGTTTTATGGCCAGAGACCTTGGAGACCGGGACGA
CTCAGTGCGGATCCACATGATCCACAAAAGGAACAAGCGGCGGTGTTGGCACTGCAATAC
AATGAAAGAATTTTCAACCATTCTGCTATGATAATTAGTTTTCTAGGTAGTGCTATCTCC
CAGTTTAAAACATTTCACTCGCCCAGAATGAGGAAGCAGTTAGTGGTTGAGATGGCTAAT
GAATATTATTTTTGTGCGGACTATGGTAAAGCTTTGACTTTATTGTCTCATATGCTCTGG
GATTATAGAAAAGAAAAGTGGTGGTTTTTGGCTTCCCATGTCTTAAACCGAGCTTTACAA
TGTGCCTACTTGTCTGCAAAAATTCAAGACTACATTCATTTATCAGTGGAGGCACTCTCC
AAATACATTCAAGTGCCAAACAACGACAAAGATAGAATATTTAGAAACATAATGGCAGTT
CTCAACATGAACATTCCATCACCAGAGCCGAACCTCCCTCCTTCCTCACAGAGTAAAGCA
TTAGAAATGTGGCAACTGGCTATAGACAAAGAGCCTCTCACCATTGCCATAGATATGATA
AACATAGCCAGTTTCCTAGAAGTAAAAGCAAAGTTCAAGCAACAGAAATATAGGATGGAT
GATACAATTGAAGTTGAGTTGTTTGTTAGACTTACATATAACACAACCCTTGATGTTAAA
AGTGCCTCTATGACAATTGCAACAAATACAGAAACTATTGACATAAATATAACGGATGAA
GGCAGTACTACACTGAAACTGATCAGAGGAGAAGTTAAAAGGTTTCTGTGTCAATTTAAA
GCCAGTCCACATGATAATGGATCGGAAATGAAAATCAAAAATGTATCATTTGTATTGGAC
AGTGACAGGAGAAAAATTATAATGAACTTTAAAATCGATGAAATCAAGAATGTAGAGCCC
ACAGTCCATCCTGAATTACTACACTTCATAATGAGTCCTAAAAGTGACTATGAATTTGAT
TGTATAATGCCTTTGACCACTACATCCATCACCAGCAGGGAATGTAGACTGTCTTTAGAT
ATTAAAAATGCAGTGCCGGCTTTACAAGGCGAGTGGTTTCCCACCACTTTCACAGTAATA
AACCATGAAGACGGTCCCGTTCATGATATGTCAATAGTGCTGACACTTCTAAGCTCTCCT
GATAATCCAAACCCTGAATCGGTCACAGAGTTGGGCTTTAGACACGGTGAACCCGAAGCC
CAACCCATTAAACTCTGTGTCGGAGATGTGAATAAAAGTTCTTCATATTCAAACACATTT
TATTTAAAAACTAACAGAACAGCCACAACAACTGTTCAAATAAAAGTAACGTACACAGTA
GATGCTTATGAAACACCTCAACTTGAATGTTCCAAAGAATTCACAACGAAAATCACAGTG
ATCAAACCGTTTGATGTATCAACCAGTTTCGTGTCCATGAACTTTAAGCCTATAACGAAA
TGCTATGTAGATGATCCCTTTATAGTTATGCCTCAAATAAAAATTTTAAGTCCCTGGAAT
TTAGTTATTTTAGATACAGAACTAGAAACGGTAGAAAGCTTTAGATATGCTGATGAGAAA
AAACCTCAATCATGTATAAGTAACCTACGAGTGGCTGAGAAGAATGTGGCCTCTGATGCT
ATATGTATACAGGCTAACTACAAGCCAAAGGAAGTCGCTACGAGAGTAGGCTTGTACAAC
ATCTCTTGGCGTAGAGAGAGCAACACAGATGGCCATTGTGTTATGAGCACTACTGCCCTC
TCGGCACTTCCAATAGATGATTGCCCAATTACTGTTGAAGTCAATTATCCAGAGGTTGTT
GACCTCCAAACATCCGTGCCATTAAAATGTACTCTAATTGGGAAAACTAATACTCCTATC
AGACTGAGTCTCTCCGTGGAAGGCACAGATGCATATATGTTTTCAGGGTACAAAAAGTTC
TCCATCACTGTACCACCCAGAGATAAGGTCGAGTTATGTTACAACATTCACCCCCTGGTG
GCCGGGAACACAATCCCTCCTCGGTTAAAAGCAACAGTTCTTGGTGACACGTCTAGACAA
GAGGTTGTAAAAGAAATGTTTGACAAAATCTTTCCTCAAAATATTTTTGTTATGCCTAAA
TATAATAAATAA
Protein sequence:
MATQPSDNTEFPPEIILKPLALIGLSGLDTVNNAIHKAIWDAFSNNRRPDRAAVRFKLLN
NTFEFPVVKPKRNSYEWYIPKGILKKNWITKRVSLIPAVVVIFYDMEWNDPQWNEKIIEC
ASRVQSIRAAVEGHATRVAVVVVQSGLSPPPSEYMLGAERAQALCSACEIQSKSLFVLPH
SDHLMGYIIRLENAFYDIAQNYYHHETKNIKQHRDHLNKTTHQYLFVRHQFKLGFLNELK
QDISTAHKHYMHAYNNLLDTRQVDTNVHEIRTVSGYINYKLCKLLFALNLPRDAIAQVKS
HIERYKNRIGPTELLFEHYGWIARQYSAFGELFDEAIRLGLPAIQSQHPGFYYQYAAQFT
VKRRQAMRSVCCDASHYPPAPDPMEGIVEFYGQRPWRPGRLSADPHDPQKEQAAVLALQY
NERIFNHSAMIISFLGSAISQFKTFHSPRMRKQLVVEMANEYYFCADYGKALTLLSHMLW
DYRKEKWWFLASHVLNRALQCAYLSAKIQDYIHLSVEALSKYIQVPNNDKDRIFRNIMAV
LNMNIPSPEPNLPPSSQSKALEMWQLAIDKEPLTIAIDMINIASFLEVKAKFKQQKYRMD
DTIEVELFVRLTYNTTLDVKSASMTIATNTETIDINITDEGSTTLKLIRGEVKRFLCQFK
ASPHDNGSEMKIKNVSFVLDSDRRKIIMNFKIDEIKNVEPTVHPELLHFIMSPKSDYEFD
CIMPLTTTSITSRECRLSLDIKNAVPALQGEWFPTTFTVINHEDGPVHDMSIVLTLLSSP
DNPNPESVTELGFRHGEPEAQPIKLCVGDVNKSSSYSNTFYLKTNRTATTTVQIKVTYTV
DAYETPQLECSKEFTTKITVIKPFDVSTSFVSMNFKPITKCYVDDPFIVMPQIKILSPWN
LVILDTELETVESFRYADEKKPQSCISNLRVAEKNVASDAICIQANYKPKEVATRVGLYN
ISWRRESNTDGHCVMSTTALSALPIDDCPITVEVNYPEVVDLQTSVPLKCTLIGKTNTPI
RLSLSVEGTDAYMFSGYKKFSITVPPRDKVELCYNIHPLVAGNTIPPRLKATVLGDTSRQ
EVVKEMFDKIFPQNIFVMPKYNK