DPGLEAN19940 in OGS1.0

New model in OGS2.0DPOGS202241 
Genomic Positionscaffold579:+ 123-11421
See gene structure
CDS Length3666
Paired RNAseq reads  2472
Single RNAseq reads  5717
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004832 (0.0)
Best Drosophila hit  Autophagy-specific gene 2 (8e-123)
Best Human hitautophagy-related protein 2 homolog B (5e-142)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC016380 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC016380 [Tribolium castaneum] (0.0)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families
  
IPR015412 Autophagy-related, C-terminal
IPR019443 Domain of unknown function FMP27, domain-6, C-terminal
Orthology groupMCL11586

Nucleotide sequence:

ATGAGGGAATTCACAAACAGCTCCGTAGAACATTCCGCTATACATCTAGACTTCAGTCTG
CCCATCCTCAGCTTACAACTGGAATCAAAGCAGCTGTACGAGATCCTGTACAACCGCATT
AGTTCCGAGCTGCTGCTGTGGTCTCCTCGTGAAGAGTTTGATATCGCCCCCCCTCCGCCA
CCCTCCTTCGAACCATGCCGTGGAGGATACGATTCAGACTCGGAGAGCTCGTCGTCTCAA
GAGGACAATTTATACTATTCAACATATGACAACAAATTGAAGAAAGGCATCGGCAACACA
AGACCGTTCTCAGACATCAGACAATGTGAGACGCACAACTTCTGTCTCACATTCAATGTT
GACAAGGGGCTTCTCTCTATATTAGCGCCTGTGAGGGATAGCAACAAAAGAGTTGTCCCG
GGACAGATGGGTGAATTAGTTTTGGAGGCGCATAAGCTGTCTATGTGTCAAGTCAGCGGG
CTCTATGGAAAAGCTAAGACGGCCCAAATGTGTCTGAGGGCTGCCAAGGCGACTCTATAC
CATGAACCGCTCCTGACTATACCGTCAGACAGGCCACCGTTACGTTTGTACGGCTCAGTG
TTGCCATCACACCTCAAGAAGACAATATATCCGTCGAATAAAGGCGTCATAATAAAAGAT
AGATTGAAGCCTAAGGACATGTTCACAATGGCGCTGAAAACTGAACCTGACACTGAGACG
CCCAATTTGAAGACAATATGCATAGCCCTGGGCATTGAACAGGCCACCCTCCGACACAGA
GGCGACAAGGGTATAGCGTGGCTCAGTCAGCTGTTGGATGTACTAGATGTTATAGACTAC
CCTGTGCCGGGATACACGCCGTCGCCAGTACTATCAGAATTGCATGTGCATGTGTGGGAC
TGCGCTGTGGACTACAGGCCGCTGTATCTTCCAATACGTAGCGTGGTGACGCTTGGCAAC
TTCAGCGTTTCCAGCAACCTTATACCGGAAACCAACACGTCCTACCTCCGCTTCCTCGCT
CAAGAATGCTCACTCCACCTCAGCTATCTCCACAGCAAGACTGTAGCGCCAGACGACAGA
GCACCAGATCTCCACAAGGAATACGTCTGCGTCATTGATGTCGGACTGTTTGAACTGTCC
CTTAGAATGGAAGATAAAAGCAATGGCAGCCAAGACCATCCTCAGGTGGACCTGACGGCG
TCCAACAACATGGTGACTATGTTCACGTGTTGGGATTCCGCGTCCGCGCTGTGCCGTCTG
TTGACTTATGTGGCGTCTGACGGGGACTCGCAGACTTACGACTCCCGACACACCAGCCTG
TGCTCTGACCAGCCCTTGGAACAGTTGGTTGGGTTAGAAGATCGACCGATAGAAGAAATA
AGAGAACTGTCGCCGAGTGAAATCCAACAAGTGAACGATTTGATGGCGGAAGCTATGAAA
GAGAGTCCCAATAATACAATTGATGATGAGGATTTCGTGAGCTCGACGGAAAAGGAAGGT
GTGGAACTGTTCTACTTTCCTGATGAGTCAAATGTGAAGCAAAAGCAACTCGAGACAGCG
GACGCCGAGAGCGAAACTAAGTCAGTTGAATACGAAGACATGTCACACGTTGAAGAGGCC
CAGGAGGCGACGCCGACCAACATGCAGGTCGCCAGGGATCTAGGGGACCCGACTGTCACG
CCGAAGTCGACGCCAAAAAAATCAAAGCGGAAAAAGATGAGCTCGTGCGGCAGCGGCAGT
AACACGGACGACGAGTACTGTGTGGTGGAACAGCTGGCTGGTGACATGGAGATGGAGGAG
CCGGTGGTGACCTGGCTGGCTGGACCCGTCACTATGTTGAACGACCACTTCAGTGTACCA
CCAGCGAAGTCAGACGTACTCGCAGCGCCCAAGAGCTTCCCGCCACCAGTGCTCAGGTAC
ACTCTGTGTGAACTGAGCTTAACCTGGAATATGTTTGGAGGCAGTGATTTCAAACCGAAA
GAAACGTCCAAGAAATCAGTCTCCATTGATGATCCTAGGGGAGGGGGCTCGCCTGTTAGT
TCTGCGCGCAGCAAGGACTACGAGCCATACGAGAGCCGTCGCTCGTTGGCGTCCTCATAC
CGGCACGGGGTCAGTTGGAGCGCGGGAACTGACCGGGTGCGGGCGACTCACACAAGAAAA
AACGACTCCCGGGATCATCACACTTGTGTCAAGCTCTGTCTTACTAAGGTGAAGTTCCAA
CACGAGGTGTACCCGCCCGGATGCACGCAGGCTTCCAGACAGACCCTGGCTATCGCAAAA
ATAGAAGTCTTAGACAGATTAGTGTGCAGCGACATCAACAAACTGCTGAGTCAATATAAA
CTTAAAGACGAACCCGAGAGAAAAAACGCTCATATGTTAATAGTGAAAGCGGTCCACCTG
CGAGCCGACGCCTCGCTCCCGGTGCAGGAGTGCTGTCTAAAGGTGTCTCTACTACCGCTA
CAATTCAACCTGGACCAGGACACTCTCGCCTTTTTAGTTGATTTCTTCTCTAAATTGGGC
AGTGATGAGACCAATGAGGAAGACACAAAGAGCCTAGGGGCTGTCTCAACGGAGTCAGGA
TCCCGTCAAAGTACGCCCACACATAGGCCGCCCGTGATGAGCGTGGGTGCCCATTTAAAA
GACCCACCGCCCACGCCCACATCCTTAGGAGATGCCGACTGTCTCTCGCTTAACGAAACT
GTTATTCGTGACGACGAACCGCTCATGGAGACGTATGAAGCTGAACGGCTGGTGTCCGAG
AATCTCATACAACTGGAGGAGGACTTTCAGCGGCTCGGCATCAGCCACGAGAAGCCGACC
ACCAAAGTGCAAGACTGTGAACCCGTCGATGACTCGCCTATATACTTCCGTCGTGTAGTA
TTTTCTCCTGAGGTGCCAATACGTCTGGACTATGTGGGTAAGCGTGTAGACCTGTCAGCT
GGTCCTGTGGCCGGACTGCTCATGGGACTCGGACAGCTAAACTGCTCAGAGCTAACATTG
AAAAGGCTCGATTATAAGTTGGGCCTGTTGGGCCTTGAGAAGCTGGTGCAATGGGCGCTA
CACGAATGGCTATCAGACATCAAAAGACATCAACTGCCGGGGCTACTCAGTGGCATTGGG
CCCATGCATTCCTTACTACAGATAATCACCGGCATCCGCGACCTGGTCTGGTTGCCGGTG
GAGCAGTGGCGTCGCGACGGGCGTCTGGTCCACGGTCTAAGACGCGGCGCCGCCTCCTTC
ACAGCTAGAACTGCTGTCGCTGCTCTGGACATCACCGCACGCATCCTACATCTCATACAG
GCGACAGCTGAAACGGCGGTGGACATGTTGACACCGGCTCCGGCTCTGCCCCTGTCGACC
CAGGGGAGGAGACGTCGCAGAGACCGCACTAGACAACCCGCTGATATACGGGAGGGAGTT
ACCAGCGCATATAACACTGTTAAAGAGGGTTTCGCGGAGACGGCCGCATCATTATCAGCG
GCGGCTCGTCGGGGGAAGGGCGCGGGGGTGCTCCGTCAGTTGCCGGGGGCTGCGGTCGCG
CCCCTCGCCCTGGCCGCGGCCGGCGCCGCCGACGTCCTGGGAGGTGTCCGAGCACACCTC
GCACCGCACACCACGCGTGATCACGCAGACAAATGGCGCAGACCATTCACAGATACGACT
GATTAA

Protein sequence:

MREFTNSSVEHSAIHLDFSLPILSLQLESKQLYEILYNRISSELLLWSPREEFDIAPPPP
PSFEPCRGGYDSDSESSSSQEDNLYYSTYDNKLKKGIGNTRPFSDIRQCETHNFCLTFNV
DKGLLSILAPVRDSNKRVVPGQMGELVLEAHKLSMCQVSGLYGKAKTAQMCLRAAKATLY
HEPLLTIPSDRPPLRLYGSVLPSHLKKTIYPSNKGVIIKDRLKPKDMFTMALKTEPDTET
PNLKTICIALGIEQATLRHRGDKGIAWLSQLLDVLDVIDYPVPGYTPSPVLSELHVHVWD
CAVDYRPLYLPIRSVVTLGNFSVSSNLIPETNTSYLRFLAQECSLHLSYLHSKTVAPDDR
APDLHKEYVCVIDVGLFELSLRMEDKSNGSQDHPQVDLTASNNMVTMFTCWDSASALCRL
LTYVASDGDSQTYDSRHTSLCSDQPLEQLVGLEDRPIEEIRELSPSEIQQVNDLMAEAMK
ESPNNTIDDEDFVSSTEKEGVELFYFPDESNVKQKQLETADAESETKSVEYEDMSHVEEA
QEATPTNMQVARDLGDPTVTPKSTPKKSKRKKMSSCGSGSNTDDEYCVVEQLAGDMEMEE
PVVTWLAGPVTMLNDHFSVPPAKSDVLAAPKSFPPPVLRYTLCELSLTWNMFGGSDFKPK
ETSKKSVSIDDPRGGGSPVSSARSKDYEPYESRRSLASSYRHGVSWSAGTDRVRATHTRK
NDSRDHHTCVKLCLTKVKFQHEVYPPGCTQASRQTLAIAKIEVLDRLVCSDINKLLSQYK
LKDEPERKNAHMLIVKAVHLRADASLPVQECCLKVSLLPLQFNLDQDTLAFLVDFFSKLG
SDETNEEDTKSLGAVSTESGSRQSTPTHRPPVMSVGAHLKDPPPTPTSLGDADCLSLNET
VIRDDEPLMETYEAERLVSENLIQLEEDFQRLGISHEKPTTKVQDCEPVDDSPIYFRRVV
FSPEVPIRLDYVGKRVDLSAGPVAGLLMGLGQLNCSELTLKRLDYKLGLLGLEKLVQWAL
HEWLSDIKRHQLPGLLSGIGPMHSLLQIITGIRDLVWLPVEQWRRDGRLVHGLRRGAASF
TARTAVAALDITARILHLIQATAETAVDMLTPAPALPLSTQGRRRRRDRTRQPADIREGV
TSAYNTVKEGFAETAASLSAAARRGKGAGVLRQLPGAAVAPLALAAAGAADVLGGVRAHL
APHTTRDHADKWRRPFTDTTD