Monarch geneset OGS2.0

DPOGS211089
TranscriptDPOGS211089-TA2991 bp
ProteinDPOGS211089-PA996 aa
Genomic positionDPSCF300007 - 1139907-1155997
RNAseq coverage1947x (Rank: top 6%)
Annotation
HeliconiusHMEL0124840.077.77% 
BombyxBGIBMGA002969-TA0.065.53% 
Drosophilajp-PD2e-15363.10% 
EBI UniRef50UniRef50_Q16R130.073.44%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q16R13_AEDAE
NCBI RefSeqXP_001661427.10.073.44%hypothetical protein AaeL_AAEL011094 [Aedes aegypti]
NCBI nr blastpgi|3800269670.051.14%PREDICTED: uncharacterized protein LOC100866746 [Apis florea]
NCBI nr blastxgi|3838646910.051.26%PREDICTED: uncharacterized protein LOC100876850 [Megachile rotundata]
Group
KEGG pathwayame:4131953e-15 
 K04575 (ALS2)maps-> Amyotrophic lateral sclerosis (ALS)
InterPro domain[126-148] IPR0034095.4e-06MORN motif
Orthology groupMCL11741 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211089-TA
ATGCAGCCGAGCGAGCCAGCCGACACCGCGCACGCCGCTAGCGGAAACCCGCAGCGTGGCCTCAACGGTGGCCGATTCGACTTTGACGATGGTGGTACATACTGTGGCGGTTGGGAAGACGGCAAGGCCCACGGCCACGGCGTCTGCACCGGACCGAAGGGTCAGGGAGCGTACGCCGGCTCGTGGCACTTCGGATTCGAAGTATCCGGCGTGTACACCTGGCCCAGTGGTAGTTCATTTGAGGGGCAATGGCAGAACGGAAAGCGACACGGTCTGGGCGTGGAGACACGAGACCGATGGTTGTACCGTGGAGAATGGACACAAGGCTACAAGGGTCGCTACGGCGTGCGGCAAAGCAGCACCAGCAACGCCAAGTACGAAGGCACCTGGGCCAACGGCCTCCAAGACGGATACGGTTCAGAAACATATGCGGACGGCGGAACTTATCAAGGACAATGGATGCGGGGCCTTCGCCATGGATACGGAGTACGGACGTCAGCGCCTTTTGGGCTGGCTTCTCACTACCGCGGTGGTGGGCACCATCGGGGCTCGCTATCTTCTTTGGCGGAGGCTACTGGAACACCTGATCCTTCAGATCGGCGTACCACGCGCATGGATGAAGCAAGGGGAGGCTTCGTCCTTAAGGCCAGCTCGGATGAACCTTCAGGGAGACGAGGATCACTCGTCGAAAAAACTAAAAAGGGACTGCTTGCTAAACTTCGCAAACAGCGCAGTGCGGGTGAACTGGACAAACGCGGTACGGGCTCGGTGCGATCAGGAGGCTCCGGCGGATCTGCATCATCATGGGTGTCTTCAGTCGAGAGCACACATTCTGCGACTACACGAGGATCACTGCACACCAACTCGAATACCAGTTTCATTGTCGAGGACGAGCATCTTGACGCAAGTGTGACAGAATCATATATGGGCGAATGGAAGAATGATAAACGGACGGGTTTCGGTGTTAGTGAACGGAGTGACGGTCTCCGGTACGAGGGCGAGTGGTTTGCGAACCGAAAATACGGCTACGGTGTGACAACGTTTCGAGATGGGACCAGAGAGGAGGGCAAATACAAAAACAACGTTCTTATAACCAGCCAAAAACGCAAACATATGTTTCTCATGCGTTCGGCTAAATTTCGAGAACGAGTTGATTCAGCGGTAAATGCAGCTCAACGTGCTTCAAAAATAGCACTACAAAAAGCGGACATTGCTATCTCTAGGACCGCTACAGCGCGAGGCAAAGCAGAACAAGCCGATGAATCGGCAGACCAAGCTAAAGAAGACTGTGATATAGCACAATCAACTGCCAAGCAGTTCGCACCAGATTTTAAGCATCCTGGTTTTGATCGTATAGGATTAAGAGAAAAGTATAGGCAAAAATCTTTCGAACCACAGGTCACTGCTGCTGTTTCACAAGAATCTGATAAAATACAAGAAGGAAAGAGCATTCCAAATCATATCCCTCCGATGCATTCACAGATTCCACAAATGAATAAAATGCCAAATGCTATGTCTTCTAGAAGACCATCTGCCCAATATCCAAATTCTGGCCATCAAGTCGATTCCAGAATTCAAAATAACCCCAATACTTACAATCCTAATGAGAGAAATCTTGGTCCTTCTTACGATCAAATGTATAAACCTAATCAAGACTCTTGGACGAATCCGAGCATATCACAAAATCAATTATCGAATCAAAAATCATATGGACCTATCGATCCAACTAGTGGACATAGTGTTTCACCTCCAGACAGTTATTCCCAACAACTACACCGATTACAACAGCAACAAAGCATAGACCAAAGCAACCAACAATATGTAAACCAAAATCGAAGATTTTCATCTTCTGTCCGTCCACCTAACACACAACCGTCATCACAGGAATGGAACGCTTCATCAGGGTTGACTAGACGACAAAGCATACTTGCACAACCAACCCATGATCCTCAAGGTAACATCTATGTCGATAGCGGTACTGCTTCTTACGGTAGGCACGAACTTCGAACGGGAACCATTCATTCTGAGAGCGCTTTAAATACGGAATCGCAAATGTACCGTCAAACCGGAGAAAACCAAGGTTACCGACAACAAATCGATCAGCAAATGTATAAGCAAGAACCTGACCAACAAATGTACCGTCAACTACCAGCAGAGCAATCAGGTTACAGACAAACACCCGACCGACAAATGTACCGGCAACAAGCCGTTGATCAACCCTTATATCGTCAAGCTGGTCCAGCTGAACAAGAAGGAATGAATGGTGATCGCGACGGTCGACAAGGGCCTCGGACATCAATTGATTATTTCGACCACTACAAACGGCCGCCCAGCAGAGACTCCAGTGTGGATCGTTATGGCAGACGATCAAGACAGCCGTCAGTTGAAGCCGTTCCTCCTGGTAGTGGATCGCGAGCCGGATCTGTGGCACCACAACCACCCCCTCTATCACGACCTGCTTCTCGAGCGGCAACTCCAGCCGGCAATGGACATTTAGCGTCTGGCCGCGGTTCCATTTCACGAGCATCATCACGAGAGCCTCAACCTTTCGAGGACTCACTTTTGCGAAAGCGAAATCTTGGACAGGACATATCTCCTTCGCCCTATCAACCAAAACGAACAGAAAGTTTATATGTCACACAAAATCCAGCGCCGCCCCCCGCTGTGCCAATGGGTGGAGGTGGCGGTGGAAGAAAGATACTGAGCACTCCTCAGACCCTACAACGCAAGAAGTCACTCCCGGACGTTGCTGCTATGCCGCGGCTGCCTGACGGTGGTGGCATGTCTCGGGAAGAAGTCTCCGCCCTGGGCTCCGCTCGTAGAGAGGAAGTCCGACGCATGCACGAAGAGACAGAGAAACTGAGAGCCAACCCTCTCCTCTACTTAGTCAGTCCGCAAGTTAAGGACTGGTTTTCGCGTCAGCAGTTAGTAATTCTGGTATTGTTCATAAACATCTCGTTAGGGATCATGTTCTTTAAACTGCTGACGTAG

Protein sequence:

>DPOGS211089-PA
MQPSEPADTAHAASGNPQRGLNGGRFDFDDGGTYCGGWEDGKAHGHGVCTGPKGQGAYAGSWHFGFEVSGVYTWPSGSSFEGQWQNGKRHGLGVETRDRWLYRGEWTQGYKGRYGVRQSSTSNAKYEGTWANGLQDGYGSETYADGGTYQGQWMRGLRHGYGVRTSAPFGLASHYRGGGHHRGSLSSLAEATGTPDPSDRRTTRMDEARGGFVLKASSDEPSGRRGSLVEKTKKGLLAKLRKQRSAGELDKRGTGSVRSGGSGGSASSWVSSVESTHSATTRGSLHTNSNTSFIVEDEHLDASVTESYMGEWKNDKRTGFGVSERSDGLRYEGEWFANRKYGYGVTTFRDGTREEGKYKNNVLITSQKRKHMFLMRSAKFRERVDSAVNAAQRASKIALQKADIAISRTATARGKAEQADESADQAKEDCDIAQSTAKQFAPDFKHPGFDRIGLREKYRQKSFEPQVTAAVSQESDKIQEGKSIPNHIPPMHSQIPQMNKMPNAMSSRRPSAQYPNSGHQVDSRIQNNPNTYNPNERNLGPSYDQMYKPNQDSWTNPSISQNQLSNQKSYGPIDPTSGHSVSPPDSYSQQLHRLQQQQSIDQSNQQYVNQNRRFSSSVRPPNTQPSSQEWNASSGLTRRQSILAQPTHDPQGNIYVDSGTASYGRHELRTGTIHSESALNTESQMYRQTGENQGYRQQIDQQMYKQEPDQQMYRQLPAEQSGYRQTPDRQMYRQQAVDQPLYRQAGPAEQEGMNGDRDGRQGPRTSIDYFDHYKRPPSRDSSVDRYGRRSRQPSVEAVPPGSGSRAGSVAPQPPPLSRPASRAATPAGNGHLASGRGSISRASSREPQPFEDSLLRKRNLGQDISPSPYQPKRTESLYVTQNPAPPPAVPMGGGGGGRKILSTPQTLQRKKSLPDVAAMPRLPDGGGMSREEVSALGSARREEVRRMHEETEKLRANPLLYLVSPQVKDWFSRQQLVILVLFINISLGIMFFKLLT-