Monarch geneset OGS2.0

DPOGS203950
TranscriptDPOGS203950-TA3537 bp
ProteinDPOGS203950-PA1178 aa
Genomic positionDPSCF300005 + 122188-149683
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0120850.077.05% 
BombyxBGIBMGA000478-TA0.055.75% 
DrosophilaOseg1-PA0.041.25% 
EBI UniRef50UniRef50_Q6NYH10.040.76%Intraflagellar transport protein 122 homolog n=5 Tax=Coelomata RepID=IF122_DANRE
NCBI RefSeqXP_972704.10.044.01%PREDICTED: similar to intraflagellar transport 122 homolog [Tribolium castaneum]
NCBI nr blastpgi|910783120.044.01%PREDICTED: similar to intraflagellar transport 122 homolog [Tribolium castaneum]
NCBI nr blastxgi|910783120.043.97%PREDICTED: similar to intraflagellar transport 122 homolog [Tribolium castaneum]
Group
Gene OntologyGO:00055154.8e-38protein binding
KEGG pathwayamr:AM1_29443e-10 
 K02033 (ABC.PE.P)maps-> ABC transporters
InterPro domain[19-282] IPR0110464.8e-38WD40 repeat-like-containing domain
[20-283] IPR0159432.6e-34WD40/YVTN repeat-like-containing domain
[49-88] IPR0016801.4e-07WD40 repeat
[51-87] IPR0197813.8e-06WD40 repeat, subgroup
Orthology groupMCL11468 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203950-TA
ATGAGAACTGTTCCGAAGTGGGTAAATAAAATTCATGAAGCTGATAAAATTGAGTTTTCGAGTGTACACGCAATATGCTTCAGTCCAGATGGCACTCAATTAGTAGTAGGTGCTGGAGAAAAGGTCATGGTTTATGACCCAAGAGATGGTTCACTGTTACAACTTCTGCAGGCCCATAAAGGAATGGTTCATACTGTAGCTTATTGCAGTGATGGCAAAAAGTTTGCTAGTGGTAGTGCAGACAAAAATGTAATCATATGGACATCTAAAATGGAAGGTGTTCTTAAATATTCACACAGTGAAGCAATCCAATGTGTAGCATATAATCCGGTCACTTATCACTTAGCTTCATGCGCACTCTCTGATTTTGCATTCTGGTCAGCCGATGTCAAAGCTGTTCAGAAATATCGAGTGGCAGGTCGGATTACTAGTTGTGCTTGGGCAGCAACAGGTCAATACCTAGCTATTGGACTTGCTAGTGGCATAGTTTCAATTCGCAATAAGGTTGGTGATGAAATTACCAGAATAACAAGAGATGCTGCAGTGTGGGCTGTAGCATTCTATAAGAACACATTATTGGTTACAGACTGGAATGATACCCTATCATTTTATGATATGATGGGACAACCTCTTTTGAAAGAAAGAAATATAGAAATTTCGGCTGTTTCAATGACAATTTTGGGTGCCTTGATATTAGTTGGTGGCTTGGGGGGTTGGGCCATACTTACCTCAGAAGGAGTATCAATATTAAATACATCACTGGACTGGGTATGGTCTATTGCACCATCACCAATTACAAACACTATGGCAGTTGCATGTCAAGACGGAACTCTGTGGTGTTACCAAGTTGTCTTCAACACAGTTCATGGGCTATTTCGAGAGAGATATGCTTATAGAGAAAATATGACAGATGTTATTATACAGCACCTAACAACGGGCAATAAGGTTCGGATTAAATGTCATGATAGAGTGCAAAAGATTGCTATTTACAAACATCGTTTGGCGGTTCAACTCCCCGAAAGGGTAGTGGTTTATGAACAAGGTGATCCTGAGGGCATGTTATATCGCGTCAAGGAGAAGTTGGTTCAGAAATCTGAATGTTCGTTGTTGGTAGCCACCAGCGAATCCCTGTTACTATGTCAGGACACAAAACTGGTAATGATCGGTTTGAAGATACCAAAATCATGGACCGTCCCATCACCGATACGTTACGTCAAAGTTACTAGTTTATACTTTGAAGAAGTATTACTCTTAGGATTGCTTAATGGACAGATATGGCAAGTGGAGCCTAACAAAGGTACGGCTAGGATGGTTGTGCAGACTGCGGGTAGTGTCCGTTGTTTGGACGTGAGCGCCTCACGTGGCCGGCTAGCTGTCGTGGATGAGAACTCAGTCTGCCGCGTGTACAGCCTCCCAGCTGGGGATCTCCAATATACGGAGGAGAACGTGTCGTCAGCTTCGTGGAATTCATGGTGCGAGGAACTTCTCTGTCTCTCTGGTAACGGCTTGTTGTCAATCAAGGCGGGACAATTTCCACCAGCAACACAACCACTGGCCGGCTCCGTTGTCGGCTTCCAGGGTGGTCGTGTTTTTTGTCTGCAAGCCAATTTGATGCAGACTATCAATGTTCCTCTATCTCACGCTGTCCATCAGTACGTACAACAAAAGATGTTCAACGAAGCGTATGCAGTCGCTTGTATGGGAGTGACAGCATTGGACTGGGAGCGTCTTGGAACCGCTGCCTTAGAAGAGCTCTCCTTCGAAGTGGCTCGCAAGGCCTTCCAGAGGAGCGAGAATATTGTTTTATTATCGCTCATTGATCACTTACAGGAACGCTTGGAGAGCGGTGAGAAGAGGCAGGTTATAATAGGCGAGGTGTTGGCTTACCGCGGACGATACAACGAGGCGGCGAGGGCGTTCCAAACAGCTGCGAGGAATGACAAGGCACTGGCGCTCTATCTAGACCTTCGCATGTTTAATAAGGCACAAGAATATGTAGGCGAGGGCGAGGGTGTGACCAAACTAGCACGTCAAAGGGCGGAGTGGGCTCGAAGAGTCAATGAACCCAGAGCAGCGGCGGAGATGTACCTCGCGGCAGGAGATGTGCGCAGTGCTGCCACCATACTAGCAGAGAGCGGCCGACGGGATATGCTAATAGAACTAGCTCGCAAAATGGACAAAGGTTCCAGTGAATCTTTACGTCATCTGGCAGAGGCGCTCGTAACTGCTGGTGAATACCCCACAGCTGGAGATGTCTATCACCGACTAGGAGATTACAAGAAAATGGCTCAGTTAGCTGTGACAGCTGGCGATTGGGTACGTGCGTTCTCTTTGGCACGCGAGCACGAGGAATGCAGGCGTGATGTTTATCTGCCTCATGCGCACCGTATGGCCAGAGAGAACAAATTTGTTGAAGCCCAAAAAGCATATCATATGGCGGGTGAAACAGAAACAGCTATGCGTGTTTTCAGTATTCTTGTAAATAATGCTGTGGCAGAGGAAAGATTCAATGATGCCGGATACTTACATTACTTGCTAGCAACACAGTGCTTAGAATCAGCAACTGCAGCACAGAACAGGGATAGAGCTACATTGCTGCATCAGTACGCTCACAACGAGCGTCTGTCCCGTGTGTATAACGCATACGACTCGGTACATCATTGTGTTCACGAGCCCTTTTCGTTGTCTCAACCGGACGTCCTATTGAACGCCGCCAGGTACGTCCTCGCGCTCTTGGAGGAACCCCCGCCCGGACTCTCCATGTTTTGTTTGTACTTATGCTTGGCGAAACAAGCTAAAGTGCTCAATGCAAACACGCTGGCTCGTCAGATGCTCAATAAGATACACGGTCTCCAAGTGCCACCCAAGTTTCAGGAAGGAGTCGAGTTACTAATGCTGAACAGCGGAGCTAGTAGCTCGTCTGAATCAGAAGACATTTTGCCATTATGTTGGCGGTGTCGTAGCCACGTGCCAGCGCTGGCTACCACTTGTCCAAGATGCAGACATGTGTTGGCCCATTCGCTGGCAACTCACGAGGTGCTGCCGCTAGTTCAGTTCGAGCCGGCTGAAGGAATCACGTTTGAAGAGGCGATGGATCTTATAGAACGCACTCCGATACCGGAGATTGAAGGAGCTAATGAAGGCGCTGACATACTTAAGATAGATAACGACATAGACTACGCTGATCCGTTCCTTGATAAGGTCGATGAGGAGGACAAAGGCGTAGTAGTTTGCAGTCGTTTAGCTCTACTGAGATTGAATCCAGCCAGTCTAGTGATAGTGAACCGTCCCCCTCTTAAACCAGTCTTTTACCGTAACATGTTGCCCGAACTGCCAGTCACCACCTGCCCAGCCTGCTATAATCTATTCTACATGGAAGATTACGAGGTCCAAATCATCTCCAAGGGGCACTGTCCCTTTTGTAGACACAGTGCTGAAGTTGCATCAAATGAAGACGGTATAAACGACTCGATATTTAACGATTCGGGAACGTCTAGCCCTAACAGTGCCAGCAATGAGCAGTCATCATGGCACTAG

Protein sequence:

>DPOGS203950-PA
MRTVPKWVNKIHEADKIEFSSVHAICFSPDGTQLVVGAGEKVMVYDPRDGSLLQLLQAHKGMVHTVAYCSDGKKFASGSADKNVIIWTSKMEGVLKYSHSEAIQCVAYNPVTYHLASCALSDFAFWSADVKAVQKYRVAGRITSCAWAATGQYLAIGLASGIVSIRNKVGDEITRITRDAAVWAVAFYKNTLLVTDWNDTLSFYDMMGQPLLKERNIEISAVSMTILGALILVGGLGGWAILTSEGVSILNTSLDWVWSIAPSPITNTMAVACQDGTLWCYQVVFNTVHGLFRERYAYRENMTDVIIQHLTTGNKVRIKCHDRVQKIAIYKHRLAVQLPERVVVYEQGDPEGMLYRVKEKLVQKSECSLLVATSESLLLCQDTKLVMIGLKIPKSWTVPSPIRYVKVTSLYFEEVLLLGLLNGQIWQVEPNKGTARMVVQTAGSVRCLDVSASRGRLAVVDENSVCRVYSLPAGDLQYTEENVSSASWNSWCEELLCLSGNGLLSIKAGQFPPATQPLAGSVVGFQGGRVFCLQANLMQTINVPLSHAVHQYVQQKMFNEAYAVACMGVTALDWERLGTAALEELSFEVARKAFQRSENIVLLSLIDHLQERLESGEKRQVIIGEVLAYRGRYNEAARAFQTAARNDKALALYLDLRMFNKAQEYVGEGEGVTKLARQRAEWARRVNEPRAAAEMYLAAGDVRSAATILAESGRRDMLIELARKMDKGSSESLRHLAEALVTAGEYPTAGDVYHRLGDYKKMAQLAVTAGDWVRAFSLAREHEECRRDVYLPHAHRMARENKFVEAQKAYHMAGETETAMRVFSILVNNAVAEERFNDAGYLHYLLATQCLESATAAQNRDRATLLHQYAHNERLSRVYNAYDSVHHCVHEPFSLSQPDVLLNAARYVLALLEEPPPGLSMFCLYLCLAKQAKVLNANTLARQMLNKIHGLQVPPKFQEGVELLMLNSGASSSSESEDILPLCWRCRSHVPALATTCPRCRHVLAHSLATHEVLPLVQFEPAEGITFEEAMDLIERTPIPEIEGANEGADILKIDNDIDYADPFLDKVDEEDKGVVVCSRLALLRLNPASLVIVNRPPLKPVFYRNMLPELPVTTCPACYNLFYMEDYEVQIISKGHCPFCRHSAEVASNEDGINDSIFNDSGTSSPNSASNEQSSWH-