Monarch geneset OGS2.0

DPOGS209573
TranscriptDPOGS209573-TA2184 bp
ProteinDPOGS209573-PA727 aa
Genomic positionDPSCF300015 - 1122789-1124972
RNAseq coverage13x (Rank: top 83%)
Annotation
HeliconiusHMEL0170510.077.94% 
BombyxBGIBMGA006633-TA0.069.89% 
DrosophilaCG13930-PA2e-7529.35% 
EBI UniRef50UniRef50_E2AHR74e-12135.87%WD repeat-containing protein 78 n=2 Tax=Formicidae RepID=E2AHR7_CAMFO
NCBI RefSeqXP_001810245.12e-11233.52%PREDICTED: similar to axonemal dynein intermediate chain inner arm i1 [Tribolium castaneum]
NCBI nr blastpgi|3228005282e-12334.23%hypothetical protein SINV_80528 [Solenopsis invicta]
NCBI nr blastxgi|3228005282e-12034.00%hypothetical protein SINV_80528 [Solenopsis invicta]
Group
Gene OntologyGO:00055152.8e-34protein binding
KEGG pathwaybfo:BRAFLDRAFT_2425262e-44 
 K10409 (DNAI1)maps-> Huntington's disease
InterPro domain[328-686] IPR0110462.8e-34WD40 repeat-like-containing domain
[526-686] IPR0159431.1e-18WD40/YVTN repeat-like-containing domain
[562-595] IPR0197818.4e-07WD40 repeat, subgroup
Orthology groupMCL12291 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209573-TA
ATGTCTAGTGGGAATTCGGAGTATTCTCAGCCGAGCACTAATATAATTACAGATGAATCAAGTGACTCTGAATCTACTGTAAAAACAAAAGCAGCTTATAGTTTTATGGAAAATAGAGAAAAGTACAAAGTGATAGTTGACGGTGTAGACTGTACACCAGACAATATTGTCGATAATGATTTTGTTACGATGCAAGATACTTTTACACATGCTGCTCTTGACAGTAGACCAAGAGCGCGGAGCGAGGTAGCGTTGAAGGAAGCCAAAAGTAGGACTAAATCTGCCTTAGATACGGGGATGAAAGTGTATGCTACTACAATCTCCCTTGACGACATTTATATCGACCACACACTGAATACTGAAGAAGGTTTTACGTCCGACGAACTAGATTTGGATATCGCTCCCTCCGCTTTTTATTTGCCGAAAATGAATCCTATAACCTCGTATCCACCTGAAATAACTATAACTTTAAAAGAGACAGAAACATATTTTCTGTTTGAATTACCAACAACTTCATATGAGAAAGGGACTTCAGAGGCCACATTGGTAGAAGAAGAAAATGAATTCTATCAATACATAACAGTAGGAAAAGGAAAAAACAGAAAGATGGTGACAGAAGAAACTCAATCTAAGGAATGTGTCTCGCAAACTAGACATACACTAGCTACAAGGCCACAAAAGAAAAATGCGATCAGTTTTGCTTCCATGTGGGATATGCATGACACTTACGCACGTCTTGCTCGAGTAAAGGCTGAAGAAAAACCTGACGAGATGGTGATGTATCAATCCGCCGCGCCTACTTTATTGCGAAAGAAAATAGACAAATATGACCCCAGCGACGAGCGACGCGGGAAAAATTTTGACGAAATTTATAACACCCCGCAGTTTTTGGATGCGATTCTTTTAACGGAGAGGGTTCTTTCAACCTTGAACTATGATAAAGAACAAAAGACATTCCGAGGTCTCGTGAAGATAGACCCGCTCTCGTTGGATTTAATTTATATATATAGTATGAAGCCTCTGTGGACCCTCGAATGTCCCGAAGCGGAAAATAGACCCATAACAAGCATTACATTCAATCCTAAAAATAACGATATTTTAGCCGTAGGACATGGAAAGTTTACTTATGCGGAAAAATTTACTGGATTAATATGCGTCTGGTGTACTAAGAATCCATGTAAGCCAGAGAGATTATATAACTTTCATGACCCTTTGACATCGGTAGCATTTGCAGACATAAATCCAAATTGGTTAGCATGTGGATTTTCTAACGGGGACGTCTTAATTTTAGATGTTATTTCTTATCCCATAAAAATAATTGCTAAAAGCAAACGTGATACAAATCCTTGTTTCGAACCGATCTGGACCACAAGTTGGCGGACCAGTGACAACGAGAATCAGTTCGTTATGACTACATGTCAAGACGGTAGGATCAATAGATTCACGAGCACAAAAACACACGACTTCATTTGCTCACCGATGATGCGCATATCAACCGTTGAGGGTAAAATGAAAGGCATCGAAGCGCCGAGACAGTGTCAAAAGGAAGACGTGCCGATAACGAGACATCCGGCAGCTTTGTGTATGAAATGGCATCCAAATGTAGACCATATTTACTTCGTCGGCACAGACGAGGGTTGCATCCACAAATGCTCGACGCACTACTTGAACCAGCATATGGATGTATTCAGAGCCCACTCTGGCCCTGTATACGGCATGGAATTTTCTCCATTCATGGATACTTTGCTGGTTACGTGTGGAGCGGACAGTGCGGTCAGGCTTTGGATGGAAGGGATAGATGACGTCATACTTACGATGAACTGCCCGACAGCAGTTTACGACGTAGCATTTTGCCCAGTCAATTCTACTGTGATAATATGTGTTAGTGGCAACGTACTATCTATTTGGGACCTCCGCAGAAAGAACCATATGCCTTGCGCCGAGTATACATTCCCAGGTCAAGTCGTTCTGACATACATAAAGTTCTCACCGTCAGGAGATAACGTGTTTGTCGGCGACACGTTAGGCCGCGTTCACACATTCCATTTGGAGGACACGCCTATAGCTCCATTTTATCAAAAGAAGCTATTGGATGAGACAATAAAGAAGGCCCTGTGTACACGTCCCCTCATATTAAAGCAGTTGGAGAAGTTAGAAAAATTCAGGGAGAAATTTGGAAAATAG

Protein sequence:

>DPOGS209573-PA
MSSGNSEYSQPSTNIITDESSDSESTVKTKAAYSFMENREKYKVIVDGVDCTPDNIVDNDFVTMQDTFTHAALDSRPRARSEVALKEAKSRTKSALDTGMKVYATTISLDDIYIDHTLNTEEGFTSDELDLDIAPSAFYLPKMNPITSYPPEITITLKETETYFLFELPTTSYEKGTSEATLVEEENEFYQYITVGKGKNRKMVTEETQSKECVSQTRHTLATRPQKKNAISFASMWDMHDTYARLARVKAEEKPDEMVMYQSAAPTLLRKKIDKYDPSDERRGKNFDEIYNTPQFLDAILLTERVLSTLNYDKEQKTFRGLVKIDPLSLDLIYIYSMKPLWTLECPEAENRPITSITFNPKNNDILAVGHGKFTYAEKFTGLICVWCTKNPCKPERLYNFHDPLTSVAFADINPNWLACGFSNGDVLILDVISYPIKIIAKSKRDTNPCFEPIWTTSWRTSDNENQFVMTTCQDGRINRFTSTKTHDFICSPMMRISTVEGKMKGIEAPRQCQKEDVPITRHPAALCMKWHPNVDHIYFVGTDEGCIHKCSTHYLNQHMDVFRAHSGPVYGMEFSPFMDTLLVTCGADSAVRLWMEGIDDVILTMNCPTAVYDVAFCPVNSTVIICVSGNVLSIWDLRRKNHMPCAEYTFPGQVVLTYIKFSPSGDNVFVGDTLGRVHTFHLEDTPIAPFYQKKLLDETIKKALCTRPLILKQLEKLEKFREKFGK-