Monarch geneset OGS2.0

DPOGS201278
TranscriptDPOGS201278-TA2991 bp
ProteinDPOGS201278-PA996 aa
Genomic positionDPSCF300176 - 862202-871096
RNAseq coverage81x (Rank: top 64%)
Annotation
HeliconiusHMEL0123820.064.49% 
BombyxBGIBMGA003029-TA0.062.28% 
Drosophilarg-PD6e-7941.24% 
EBI UniRef50UniRef50_E2BDN33e-15945.00%Protein FAN n=1 Tax=Harpegnathos saltator RepID=E2BDN3_HARSA
NCBI RefSeqXP_971631.15e-16045.54%PREDICTED: similar to neutral sphingomyelinase (n-smase) activation associated factor fan [Tribolium castaneum]
NCBI nr blastpgi|2700070721e-16045.47%hypothetical protein TcasGA2_TC013522 [Tribolium castaneum]
NCBI nr blastxgi|3320284572e-15744.87%Protein FAN [Acromyrmex echinatior]
Group
Gene OntologyGO:00055151.6e-35protein binding
KEGG pathway 
InterPro domain[445-725] IPR0004094.5e-120BEACH domain
[738-993] IPR0110461.6e-35WD40 repeat-like-containing domain
[732-993] IPR0159431.5e-29WD40/YVTN repeat-like-containing domain
[166-259] IPR0233625.2e-14PH-BEACH domain
Orthology groupMCL16183 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201278-TA
ATGAACAAATCCAGATTCTCCATGTTATTGTTGGAGCCAGGAGAAATATATTTTGAGGATTATTCCTGTACAATGAGTGATATATCAAACTTATCTAATACAATACAGGGTAGATTAAAACTTTGTTCAAAGTCTCTTATTTTTGAGCCAAGAGAGTGGATTTATCCACTTATTAAATTATATTTTAAAGAATGTATTGATATTAATGTAAAAACAGATGGTGGTGATGAGAAAAGTGCTTTAAATATAAGAATTAAGCAATACGCTGAGATGTTAGAAGAGAATATGCTAGCACCATATAAATTTAAATACGAATGTAAAGAGTTCCGTTTTAATTTTGATTTTGTGTCTGCTGAGGAATGTTTGAGCCAGATGCAACAATTACACAGAGCGTCAACTCTCCATGCTCCCGAACATAACAGTATGGTGGCAACTATCCAACATTCTCGTTATACGAGAATGACATTTGATCCCATTATGATGGATGATTTTACAGAAAAAATTATTGAGGAATTTCAAGCTGAAAAAATTTCACCATTAGTAAGACACCAAGGCAAATTAGCACTCACTTCTACAACCTTATATTTCCAACCGTTCAGCAATATTGAAAGTAATGCGATTCTAAAAATGAAATTAAGCGAGCTGAACAGGATATATAAAAGGAGATTTTTATTAAGGCAAGTTGGCCTTGAAATATATGGTAAAGAAGGTACTGCTGTGTCTCACATATATCTTGTATTCCAATGTGAGGATGACAGAGATCTGGCATATAAAACTCTAGAAGAATCTCCAAATGTACACTTGGAGCCCGTTCATGTCGAAGAAATGACACTCCAATGGCAGAATGGAATTGTATCTAATTATGATTATCTGATGTATCTCAACTGTCTCGCTGACAGGAGTACAAAGGACTTAACACAATACCCTGTATTTCCTTGGGTGGTGGCAGATTATACATCAGAAAAATTAGATTTAGATAATGCAGACACATTCAGAGATCTCACTAAACCGATGGGCGCTTTGAATCCTGATAGATTAGAGAAATTACTAGAAAGATTCCATGAAATGTCAGACCCGAAATTCTTATATGGATCTCACTATTCAGCTCCCGGCCTTGTGTTGTTTTATTTGAATGCGATTCTAAAAATGAAATTAAGCGAGCTGAACAGGATATATAAAAGGAGATTTTTATTAAGGCAAGTTGGCCTTGAAATATATGGTAAAGAAGGTACTGCTGTGTCTCACATATATCTTGTATTCCAATGTGAGGATGACAGAGATCTGGCATATAAAACTCTAGAAGAATCTCCAAATGTACACTTGGAGCCCGTTCATGTCGAAGAAATGACACTCCAATGGCAGAATGGAATTGTATCTAATTATGATTATCTGATGTATCTCAACTGTCTCGCTGACAGGAGTACAAAGGACTTAACACAATACCCTGTATTTCCTTGGGTGGTGGCAGATTATACATCAGAAAAATTAGATTTAGATAATGCAGACACATTCAGAGATCTCACTAAACCGATGGGCGCTTTGAATCCTGATAGATTAGAGAAATTACTAGAAAGATTCCATGAAATGTCAGACCCGAAATTCTTATATGGATCTCACTATTCAGCTCCCGGCCTTGTGTTGTTTTATTTGGTAAGGAAATATCCACAGTACATGTTGTGTTTGCAAAACGGAAGATTTGATCATCCGGACAGAATGTTCAATTCAGTGAAGGACGTTTATAATAACTGTTTGAAGAATATGTCGGATTTTAAGGAGCTCGTGGCAGAGTTCTACGACACAAGCACAAAGGGAGATTTCCTAGTTAATATTCACGATATAGATTTCGGGGAGCGCCACGACGGAAACAAGGTCGCGGACGTGGCCTTACCCCCGTGGGCGGACACCCCAGAACAGTTCGTACACAAACTAAGACAGGCCCTCGAATGCGATTACGTGTCCAGATACTTACACGCCTGGATTGATCTCATATTTGGTTACAAGCAACGCGGCGAGGAGGCTGTCAAAGCTAATAATGTATTCCACCACGTGTGCTACGAGGGCTCTATAGACTTGGACGTGATATACGACATGAACGACAGACACGCGCTCGAAGTACAGATCATGGAGTTCGGCCAGATACCGAAACAGCTGTTCACTAAGCCTCATGTGAAGAGAATACCACAGCAGATAATAAAGCCGCTGAACACGAAGCCGACCAGTCATATAGAGTGTATTAAAACAATAGCGTTACACAAGGAGGCCGTGACCTGTGTGATGAGAGAGAGAGACAGAATCATATCGGTCGGTAAAGACGGTACTTTGAAAGTGTACGACGTTGTCCAGGATAAGCAGATACGCAGCGTCATACTGTGCGGTACGCCGCTGAGTTCCTGTGTTATGGTGGACGAGCATATAGTAGCTGCTGAGCGCGTGTTGTTGTCTGGTGGGTGGGACGCCGTAGTACGTGCTTGGAGGCGTGCGGGCGCTGGGCTAGCGCTCCGCGGACTGAGGGCGGAGCTCGACCACGACGGACGAGTCACCGCCTTGGCTGTCAGGTACAGGAAACACTACTTGGACATATTATCTGGGACCAGCGACGGCGACGTCTACCTCTGGGATTACTCCACCAAGGAGTTGATTAATAAGATCACGGTTCATCCCTCGCCGGTCGCCGGCATATGTCTGTTGTCAGAGGAGAGAATCGTCAGCGCCAGCGAGAACGGTGACTTGAGCGTCACCGATCTGAAAGTTTTGAGTTCGGTGTACGAGAAGCAGGTGTCGTCAACCCTGAGGGCGGTGTGCTCGGAAGGTGACCTGCTGTGGGCGGGCGGGGCGGGGTTGCTGCTGCAAATGGACATGCTCGCTGTAAAGCAGCTGGGGGAGTATGACGCTCATCAAGATCTGATCAGGTCCATTCACTATGATGCAGCCTCGAAAACCCTCACAACGGCTAGTGATGATAAGTCAATTAAAGTCTGGATAATAACATGA

Protein sequence:

>DPOGS201278-PA
MNKSRFSMLLLEPGEIYFEDYSCTMSDISNLSNTIQGRLKLCSKSLIFEPREWIYPLIKLYFKECIDINVKTDGGDEKSALNIRIKQYAEMLEENMLAPYKFKYECKEFRFNFDFVSAEECLSQMQQLHRASTLHAPEHNSMVATIQHSRYTRMTFDPIMMDDFTEKIIEEFQAEKISPLVRHQGKLALTSTTLYFQPFSNIESNAILKMKLSELNRIYKRRFLLRQVGLEIYGKEGTAVSHIYLVFQCEDDRDLAYKTLEESPNVHLEPVHVEEMTLQWQNGIVSNYDYLMYLNCLADRSTKDLTQYPVFPWVVADYTSEKLDLDNADTFRDLTKPMGALNPDRLEKLLERFHEMSDPKFLYGSHYSAPGLVLFYLNAILKMKLSELNRIYKRRFLLRQVGLEIYGKEGTAVSHIYLVFQCEDDRDLAYKTLEESPNVHLEPVHVEEMTLQWQNGIVSNYDYLMYLNCLADRSTKDLTQYPVFPWVVADYTSEKLDLDNADTFRDLTKPMGALNPDRLEKLLERFHEMSDPKFLYGSHYSAPGLVLFYLVRKYPQYMLCLQNGRFDHPDRMFNSVKDVYNNCLKNMSDFKELVAEFYDTSTKGDFLVNIHDIDFGERHDGNKVADVALPPWADTPEQFVHKLRQALECDYVSRYLHAWIDLIFGYKQRGEEAVKANNVFHHVCYEGSIDLDVIYDMNDRHALEVQIMEFGQIPKQLFTKPHVKRIPQQIIKPLNTKPTSHIECIKTIALHKEAVTCVMRERDRIISVGKDGTLKVYDVVQDKQIRSVILCGTPLSSCVMVDEHIVAAERVLLSGGWDAVVRAWRRAGAGLALRGLRAELDHDGRVTALAVRYRKHYLDILSGTSDGDVYLWDYSTKELINKITVHPSPVAGICLLSEERIVSASENGDLSVTDLKVLSSVYEKQVSSTLRAVCSEGDLLWAGGAGLLLQMDMLAVKQLGEYDAHQDLIRSIHYDAASKTLTTASDDKSIKVWIIT-