Monarch geneset OGS2.0

DPOGS211108
TranscriptDPOGS211108-TA4746 bp
ProteinDPOGS211108-PA1581 aa
Genomic positionDPSCF300007 - 708856-720617
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0124470.058.66% 
BombyxBGIBMGA002983-TA0.048.42% 
Drosophila% 
EBI UniRef50UniRef50_E0VF112e-4941.47%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VF11_PEDHC
NCBI RefSeqXP_973331.27e-5439.22%PREDICTED: similar to Activating molecule in BECN1-regulated autophagy protein 1 [Tribolium castaneum]
NCBI nr blastpgi|1892378031e-5239.22%PREDICTED: similar to Activating molecule in BECN1-regulated autophagy protein 1 [Tribolium castaneum]
NCBI nr blastxgi|1892378032e-5428.94%PREDICTED: similar to Activating molecule in BECN1-regulated autophagy protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055153.3e-21protein binding
KEGG pathway 
InterPro domain[1435-1579] IPR0159433.3e-21WD40/YVTN repeat-like-containing domain
[133-250] IPR0110463.2e-16WD40 repeat-like-containing domain
[135-175] IPR0016801.1e-07WD40 repeat
[136-175] IPR0197811e-06WD40 repeat, subgroup
Orthology groupMCL26737 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211108-TA
ATGAACGTGCCGCGCATTCTCAAAGGAAACAATCTAGCATTCTACCGCAAGCCGTTTGAGATCGGAATATTCAAGCGATTTCTCACTTTTCTTATTCTTCACCTATCACAATTTAAAAAGGTAATGGATCCAGTTATTGACAATGAATGGCTGGGATGTAAGAATGATCCGATCGATGAACCACCGACATTTGGCAATATAGCACGGAGCTGGCAGTGGCGGGAGAGGGGAGTCAATGCTCCGAACTCACCGTCAAACAAGACAGTACTTGAGAATGTTGCTGAGGATGTACTTGTACAAAAGCCTCTTAAGATACGAGGGGAGTTTCCACTGATAAATGGCAACTGCTTAGTGTTTGGTCGCCTTGTGACCACAGCCTGTGCGACTTGGGCCAAATTAGCTAGTGGGAAACATGTGAGGATTTTGAAAGGTCATCCTCGGACACCGTGGTGTATCGCTTTCCATCCATCCCATCCACAGCTCATAGGATCTGGATGTCTTGCCGGACAAGTGAGAGTGTGGGATATATCGAGTGGTGGCAGTGAAGTGTGGAATGTGAGGAACGAGACTGTCATAGCCTCGATAGCTTTCCATCCAAGGGTACAGTTGTTGGTGATTGCTACGTACAATGAGCTATACTTCTGGGATTGGAGCCAACCAGCACCAATGACTAAGGTCTCCACCAATAACATCAATGAAAAAGTCAGATACGTGGCATTCGATTCTCTCGGCTACAAGTTAATAACCGGCATATCGTTGAGCGCGATAAGGGGTGGTGTAGACGCGACCGTTCTCCAGCACGCTCCATCAAATCCCCAAAACAACAGCAACCGTAGTCTCCACCCCGATGATAATGACGGCAGCAATAGACCAGACACGGCTGGGGCGACACAGGACGTTATAGTGAATTCATATCAGTATCTGGTACAGAGGTACGACAGTCTTGTCAGGAATTATCAGAGGCTGTTCATGGTTAGGAACCGGCTCGCTGTCACACCGCCGCCTAACACGACGGACAGAGGAACTGATCCAATGGAAACAGACCCAGCTCCAGAAGCAAACAGCTATACACGCACGGCTCGCTCCTATACAGCCACAACACAGTTGGATGACGAAAACCAAAACTCGAATACAAGTAATAGTAACACCGCTGAAGACACCGCTGACGCTGCTGAAGAAGGGAGTAGCGGTCTTCTGAATCTTGACGCCCTTTCCTTTGATAGGGCACCTCCTGGAGAGAGAAACATGTCGAGACTGGAACTTGCTATATTCGGTGCGCGTCCGTCAGCTTTTCATCCGCTCACCAACGAAAATCCCGGACGATCTTTCAGAACGGTCGATCAAAATGAAAGAGCCTCAAGACTTTTGCCAAATCCTTTTTCAAACACAAACGGTACACCGCGGCCCAGCCGCGTCTCATTCGCCAGTAGTCCGTTGACCCAACAATCAAACAACGATAACGCGCCAAATGCTTCTCGTACCAGTATTTTCACGTCATACAGACCTAATTCTCCGTCTGTGTTGGATTTATCGTCGCGCAGGTGGAGTGGTAATCGTAGAGTATTTATTCCAAGACCCAGCGACTCACAACCGTCGAATAGCAATTCGTCTACAAGATCCCGTCAGGAAGATGCTTCTTCCAACACAACAAATAGGAACACAGCAACAAATACCAGTGCATCTTCAAATTCGAATTCTATTCCAAATATATCTTTGAGTAATAATTCATTATCCACTGGCAATACGCCTACAGATACAACCCCCTCTACAAATAATTCAACCAGAAGAACGTCATCACCACCTCAAGGCTCGCCAGTTGACGCGCAGACGATTAACGAGAGTTTAGATTTAATACGAGACATTCTCAGAGATTCCGGAACGAGATTGTTAAACTTAATCACAAACATGTCCGCTTCAAGTCTACCTCACGTGGGTCCGGAAGTTCCGAATAGACGCGTCAATATAAGAGCCCATTCGGACGATAGCGGTGAAAGTGAACGACCGAGGGGAGCACGACCACGGGTTCGCCTATTCAGCTCCAACTTGCAGCCCGAAATACTATCTTCGAGTTCAGAATCTGATAGTGACCAGTCCTCAGAGGTAAATATTCTTAGGACTCCGAACACGACGTTCAGAGAAGAAGACTCGCGAGAAGAATCGTCTCGAACTACCGAAATGCCGGGACCGTCTAACAGATCAGCGCCTCCCTCTGATAGAGCAGACTCCGAGAACGATGACAATACACCCAACACTGGCATTGATACTGAGAGTGTTTCTACAGCAGCAAGTGCAAGTAGTAGTCGTAATCGACCGAGAACACAAAATAATGATGCAGGCGAGGGGCCTAGCACATCAAGTCACGCGTCTGACGAACCAGAGAACTTAGCTCAGAACAGTTCAAACTATTTCGATTCTAGTAGACCTTCCACAAGTGGAAACAATATATGGACGAATGGGGCCACAGTGCCACCAGAAGAAGCGTTTAGAAAAGTTCGGGGCGGTGTGAACGCGTTACAGAAACACACAACTCAACTGACTAATATGTGGGTACGCGGGAACCGCACTACAATGCGAGAATTACGTACCATGTGGGAGAATTTACGTAGACGAATCATAATGTTACACAGGGAAACCGGCCGCCAAGACTTACCTAACTATTACACGAGATCTCTTTTAGAAAGATGTATGATGTTGACTGAAATGACAGGAGAAGCGAGTCGTAATGCCACGAGAAATTATAGATCGCAATCTAATATGTCTTCTTATGAGCCCAGGCCTAGCACGTCTAGAGCTGTTCCGGAAGACAATAGACCCTCACTATCAGGAAGATCTCCATCAAAAAAAACACCCCAACGATCCTTATCGTCATTATTATCAAAGAGACTTCAAGCAGCCTATTGTAGATGGCGGCCAGAAGACCGAAGACGCAATCCGATTTCAAGTCGTATGTCTCAACGATATCTACATAGACCACGAAGGGAATATACAAGGTTAATAGAAATAAGTGCGACAAGGCATGAAATTCGCATGCGTGCTATGCAAGTGTTGTCCGTCATGTTTAACATGATGATGTTGTGTTTAGAGGAGCGAGGATTGAGTTCGCTGATCATAAATATGTTACGCACTCTAAAAAAGGCGCTGGCATTGACATGTTTGATGCTAATGACAAATAGGTACAACCCACGCTCCAATAATAACGATTCTCCGCAGAGAGTTGATTCAATGAATGTTATACGTTTGCAAAATGTTGATCACACCGGTCCCGTTAATGTGGACGGTCCAGATGAATCTCCGCATCATTCAATATCCAGCGAAACGGAAGAGGATCGATCGACTACACAATCTGTAGCTACAGAAAATCCTGCAACAGATGACACAACACCACCGACGGCACCCGCTAGCCAAGAACCAACCCAAAGATGGAGCAATCGGTTAGCAGTACAGATTGCGGCTGCCAATCGAAATACTTCAATGACCGCTAGAACGAGACGAGATTTGTACGTAGAAAGCAAGAGACAAAAAGCACTACATAGAACAAATCCGTTAGCCCATCCGCTAATTAAGAAAAAGGTGCTACCACCCGTATCCACATACAGAATTCCTTCGTTGCGTTTGACACCTCGAAGATTAAGTGTCAGGAATAGCGCGTCCAGGGCCAATCCAGCAAATGGAGAACCAATAGCTGGGCCGTCTGGCATTCTTTCTGGCACCGGTACCGTTCGTGGCGGACCGTCAAGGCTCCCTCCGGAAATGAGTAATGAGTTTGAACACAGAATCAATTTAATAAGAATGGCGCACATGCAAGCTGTAAGACTACGTAATGCTGCTAGAAGTAGATTTCGCCGTTTGCAAACCATCCGTCTGTACACCCCATCTTCAGTGCGAGAGATGTTCACCCTGCAGCCTGAGGGTGACAATCCCCACGAAAGCCGACCATTACAAAATAATTCTGAACCCAACCGGCGATCGCTCACGTCCTACGACTACAGGCCTCATATACTGACTCGGGAGAGAATCTTCTCTAGAGGAGCTGCTAATAGACCTTCAGAAGATTCAGGACAATTTCACGCGGTCATCAGCAACACGGGTATGCCACTGATGCAGGTCAGTGACCTCAGCATCAGCAACCAAGATGGTCAGGGCAACCAGAACCAACGGCTACCGAGAATCCATGAATACTTGCAACCGATTATTTTGGCCCAAAACGCAATGGTGGTTGATGAGGAGAGGGGTGAGGATGGGCCTGGAGGTCCGGGAGGGTCGGGGGGAGCTACAAAAAACGTGGTGGTTCAACGTTGCCGTATCCACAATGACGCGAGCATCGACATATCCAAAGACGGCAGACTGTTGGTGGCTCTCCTACCAGTACCGCGGCTCAGGAACGCGAACCATTGGCTCGGTGTTTATTCCTTGGAGTGGTCCCGTCTGGGTCAATGTCTCCACACAGCGGTGTTGGAACAGAATGCTGTTTCAGTGGCACTGTCACCAACAGCAAGACATCTGGCTGTAGGTCTTGGATCTAGAAGATTCACATCAGCAGCTCACAGTAGGAACAATGTGTTTGCACTGCTGTACAGATTAGATCCACTTGAGAATTCAAGCCGCACTGGTTTATCACCTATCAAAGAATTGGAACAGACATGGGAGCATGGCTTCACCAGCCTGAATTGTCTCCGTTGGGCCCCGCAACCCGGCCAAGGACTGGTATATGCCAATAATACAGGACAACTGATAATTATGAGCTAA

Protein sequence:

>DPOGS211108-PA
MNVPRILKGNNLAFYRKPFEIGIFKRFLTFLILHLSQFKKVMDPVIDNEWLGCKNDPIDEPPTFGNIARSWQWRERGVNAPNSPSNKTVLENVAEDVLVQKPLKIRGEFPLINGNCLVFGRLVTTACATWAKLASGKHVRILKGHPRTPWCIAFHPSHPQLIGSGCLAGQVRVWDISSGGSEVWNVRNETVIASIAFHPRVQLLVIATYNELYFWDWSQPAPMTKVSTNNINEKVRYVAFDSLGYKLITGISLSAIRGGVDATVLQHAPSNPQNNSNRSLHPDDNDGSNRPDTAGATQDVIVNSYQYLVQRYDSLVRNYQRLFMVRNRLAVTPPPNTTDRGTDPMETDPAPEANSYTRTARSYTATTQLDDENQNSNTSNSNTAEDTADAAEEGSSGLLNLDALSFDRAPPGERNMSRLELAIFGARPSAFHPLTNENPGRSFRTVDQNERASRLLPNPFSNTNGTPRPSRVSFASSPLTQQSNNDNAPNASRTSIFTSYRPNSPSVLDLSSRRWSGNRRVFIPRPSDSQPSNSNSSTRSRQEDASSNTTNRNTATNTSASSNSNSIPNISLSNNSLSTGNTPTDTTPSTNNSTRRTSSPPQGSPVDAQTINESLDLIRDILRDSGTRLLNLITNMSASSLPHVGPEVPNRRVNIRAHSDDSGESERPRGARPRVRLFSSNLQPEILSSSSESDSDQSSEVNILRTPNTTFREEDSREESSRTTEMPGPSNRSAPPSDRADSENDDNTPNTGIDTESVSTAASASSSRNRPRTQNNDAGEGPSTSSHASDEPENLAQNSSNYFDSSRPSTSGNNIWTNGATVPPEEAFRKVRGGVNALQKHTTQLTNMWVRGNRTTMRELRTMWENLRRRIIMLHRETGRQDLPNYYTRSLLERCMMLTEMTGEASRNATRNYRSQSNMSSYEPRPSTSRAVPEDNRPSLSGRSPSKKTPQRSLSSLLSKRLQAAYCRWRPEDRRRNPISSRMSQRYLHRPRREYTRLIEISATRHEIRMRAMQVLSVMFNMMMLCLEERGLSSLIINMLRTLKKALALTCLMLMTNRYNPRSNNNDSPQRVDSMNVIRLQNVDHTGPVNVDGPDESPHHSISSETEEDRSTTQSVATENPATDDTTPPTAPASQEPTQRWSNRLAVQIAAANRNTSMTARTRRDLYVESKRQKALHRTNPLAHPLIKKKVLPPVSTYRIPSLRLTPRRLSVRNSASRANPANGEPIAGPSGILSGTGTVRGGPSRLPPEMSNEFEHRINLIRMAHMQAVRLRNAARSRFRRLQTIRLYTPSSVREMFTLQPEGDNPHESRPLQNNSEPNRRSLTSYDYRPHILTRERIFSRGAANRPSEDSGQFHAVISNTGMPLMQVSDLSISNQDGQGNQNQRLPRIHEYLQPIILAQNAMVVDEERGEDGPGGPGGSGGATKNVVVQRCRIHNDASIDISKDGRLLVALLPVPRLRNANHWLGVYSLEWSRLGQCLHTAVLEQNAVSVALSPTARHLAVGLGSRRFTSAAHSRNNVFALLYRLDPLENSSRTGLSPIKELEQTWEHGFTSLNCLRWAPQPGQGLVYANNTGQLIIMS-