Monarch geneset OGS2.0

DPOGS207626
TranscriptDPOGS207626-TA2838 bp
ProteinDPOGS207626-PA945 aa
Genomic positionDPSCF300199 - 117863-122860
RNAseq coverage149x (Rank: top 53%)
Annotation
HeliconiusHMEL0119870.082.88% 
BombyxBGIBMGA006009-TA0.079.83% 
DrosophilaCG7158-PA2e-6731.49% 
EBI UniRef50UniRef50_E2AZ330.053.08%Alsin n=5 Tax=Formicidae RepID=E2AZ33_CAMFO
NCBI RefSeqXP_396645.30.048.69%PREDICTED: similar to Alsin (Amyotrophic lateral sclerosis protein 2) [Apis mellifera]
NCBI nr blastpgi|3071679330.053.08%Alsin [Camponotus floridanus]
NCBI nr blastxgi|3071679330.052.99%Alsin [Camponotus floridanus]
Group
Gene OntologyGO:00055151.1e-05protein binding
KEGG pathwayame:4131950.0 
 K04575 (ALS2)maps-> Amyotrophic lateral sclerosis (ALS)
InterPro domain[837-936] IPR0031232.2e-17Vacuolar sorting protein 9
[382-404] IPR0034092.8e-06MORN motif
Orthology groupMCL15025 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207626-TA
ATGTTATTGATATATCATTCTTTGGTTAAGCCTTTCCTTAAGAAAAGCAAATCTATTACATTACATTCCAACGTGTACGAAACTGTTTGTGATATATTCGGTGACATATTATATATAACGGCACTGAATGTGCTATCAACATGGCAATATGCTGAGAAGCACATAAGTGAATGTGATATTTCTTTAATAAAAAATCTAGAAGAGTTTATTTTTTTATATAAAAAGTATTATTTAGCTGTAAGCAATTTGATTGTTATTGGTGGTTTCGCTCACATAAATAATATTGTGGATGTGCCGTCAAGTATTTATAATTTATTTAGTGACCAGCTACCAACGAATAAACAAAAATATAATAAGAAGACATTAGAAGTAGTTGTAGCGTTAGCCTTTGTTCAGCCATTAACGAGATTAAGCTCTTACAAATGTATTGCTCAATCTTTGATACGCCATAAATTGAAGAGAAAAAAACATGATTCTAAAACGAGTATTGAGGAAAAAATCAATAAAGTGATATCTTCCTTCGACGGTTTAATAGAAGAACAAGAAAAGAAACGAAAAGAAGCAGAAGGTACAAAGCTGTTCTGGGAAACCCTTGGAAAAAGTTTAGATCAATATAGAACACCCGAAAGACGTCTCATAAGAGATTCAAAAAGCCGGCCCCTTAATCTTGTTAATCCTGGCAGGTTTAGCAGCCATTGGTTCATTTTGTTCAATGATTTGTTCGTTCACGTAAACGGGAGTACAGCAAACATCCATCCATTAGAAACGCTATGGATTGAAGCTGTGCCAAACACAGATTCATTGCAAAACGTCATACAGTTAACAACCCCGGAGGAGGTTATATGCGTTTCGACAATATCTGAACAAGAAAAAACAGAATGGATACATAGCCTGAATGCATCCATTCGAACGGCCTTAAATAAAGAATCTGCTTTCAAACCTCCGTCGATAAGAACAGCTAGTTATACATTTTCTTCTAAGAATATTTTTTATAAAGACGCTAAATATCACGGGAGATGGCTTGACGGGAAAATACATGGCAATGGAAAAATCGAATGGCCAGATGGAAAATTTTACGTGGGTCAGGTACAGTTGAACGCTTTATGTGGACATGGGAAAATGGAAATTCCTGGAGTTGGTTTATACGAAGGTCAGTGGAAAGATAATCTTCAAAATGGTTACGGTGTCCTGAAGTACACTACAGGTGATATGTATGAAGGATATTTCAAAGATGGTCTTCCTCAAGGACATGGTATCAAAAAACAAGGTGATTTCACGTCATCGACCGCAACTATTTATACTGGAGAATGGGTGAATGGTGTCAGGCAGGGCTACGGAGTTATGGACGATATTGGTAAAGGGGAAAAGTATCTCGGCAACTGGAGTGATAACAAGAAACACGGATGCGGTCTAATTGTTACGTTGGACGGAATTTATTACGAGGGTCTCTTTACAGCGGACGTGTTGACGGGTCATGGTGTAATGGTATTCGAGGATGGTACACATTACGAGGGTGAATTTCGATCAGCTGGGATTTTCTCTGGTAAGGGAGTTCTCACATTCTCTAGTGGAGATCGAGTGGAGGGTTCTCTCAGCGGTGCTTGGACTGAAGGCGTTAAAATATCCAACGCTGTAATGCATTTAAACGTCTCCAATCCAGTTCTGCCTCCTAACGCTAAGCCTGGGTCGTTCGGCAAATTAAGCGTAGCTGCGAATCAGAAATGGAATGCTTTATTTCGTCAATGCTATCAGAGCATGGGTGTGTCAGAAGCGATTCTCACACCGAGCGGTAATCATCTAGCAAACGGATCCTCGAATGTCCCAGACAACGTCAAAATATGGCAAAACGTAGCCGTCGCTATATCCAGTTCTCACAAACATTCGAAACCGGGAAAAAAGACGAGCGGATCTGATGTCGAGAAATCGGTGGACTGCCTGGAAGTTATACCAGTTTTCGGCCAGAAGACACTAACGTTAACCGACTTCACAGAACTTAAGCAGTATTTATACAATGCCTGCGAGTCCAATCATCATCCGCTGGGTCAATTAGTACTGAGTGTAACAAATGCCTTCAATACCACGTATGGCGGTGTGAGAGTTCATCCGCTCTTACTAAGTCATGCTGTCAAAGAGCTTAAAAGCATCACGTCAAGACTGTACCAAGTGGTGACACTCCTTTTCCCCGCTTTACCATACGAAGGAACTGTTGTGGTGTTGCCAGACACCTCCGATGGAAGCTGTCACGAAAACAGTAATCCAATTGATGGGGAGGTGGTGTCGAGCAATTCCTTACTGCAGCCTGTACTGTTGCCAAGAGTTCACCCCGCCCTGTTCGTGTTATACGCGTTACATAACAAAAGAGAGGACGATTTATATTGGCGAAGACTGCTCAAGTGGAACAGGCAGCCGGACACCACGCTGATGGCTTTCCTTGGGATAGATCAAAAATTCTGGACCGTCCATCGTAACTCACCGTCGCCAATGCTCTCGCCGTCAAAGGAAGAAGTATTTCAAGAGGCTGTTGAAACTCTCCAGCAATTGAAGACCACATTTTCTCCGATAGAGAAACTCCTCGTGATCCGAAGCACTTTCCAAAAGATGACAACAGCGGTGCAACACGAATTAGGGCAACATTATCTGTGGAATATGGACGAACTGTTCCCTGTATTTCACTTCGTCGTTGTGAGAGCTCGTATTCTTCAGCTTGGTTCAGAGATACACTTCGTGGAGGACTTCCTGGAGCCCGGGACTCAGTGCGGTGAACTAGGACTCATGTTTACTACACTGAAGGCATGCTACTTCCAAATTCTACAAGAGAAAATGTCGATCAACACCTAA

Protein sequence:

>DPOGS207626-PA
MLLIYHSLVKPFLKKSKSITLHSNVYETVCDIFGDILYITALNVLSTWQYAEKHISECDISLIKNLEEFIFLYKKYYLAVSNLIVIGGFAHINNIVDVPSSIYNLFSDQLPTNKQKYNKKTLEVVVALAFVQPLTRLSSYKCIAQSLIRHKLKRKKHDSKTSIEEKINKVISSFDGLIEEQEKKRKEAEGTKLFWETLGKSLDQYRTPERRLIRDSKSRPLNLVNPGRFSSHWFILFNDLFVHVNGSTANIHPLETLWIEAVPNTDSLQNVIQLTTPEEVICVSTISEQEKTEWIHSLNASIRTALNKESAFKPPSIRTASYTFSSKNIFYKDAKYHGRWLDGKIHGNGKIEWPDGKFYVGQVQLNALCGHGKMEIPGVGLYEGQWKDNLQNGYGVLKYTTGDMYEGYFKDGLPQGHGIKKQGDFTSSTATIYTGEWVNGVRQGYGVMDDIGKGEKYLGNWSDNKKHGCGLIVTLDGIYYEGLFTADVLTGHGVMVFEDGTHYEGEFRSAGIFSGKGVLTFSSGDRVEGSLSGAWTEGVKISNAVMHLNVSNPVLPPNAKPGSFGKLSVAANQKWNALFRQCYQSMGVSEAILTPSGNHLANGSSNVPDNVKIWQNVAVAISSSHKHSKPGKKTSGSDVEKSVDCLEVIPVFGQKTLTLTDFTELKQYLYNACESNHHPLGQLVLSVTNAFNTTYGGVRVHPLLLSHAVKELKSITSRLYQVVTLLFPALPYEGTVVVLPDTSDGSCHENSNPIDGEVVSSNSLLQPVLLPRVHPALFVLYALHNKREDDLYWRRLLKWNRQPDTTLMAFLGIDQKFWTVHRNSPSPMLSPSKEEVFQEAVETLQQLKTTFSPIEKLLVIRSTFQKMTTAVQHELGQHYLWNMDELFPVFHFVVVRARILQLGSEIHFVEDFLEPGTQCGELGLMFTTLKACYFQILQEKMSINT-