Monarch geneset OGS2.0

DPOGS208611
TranscriptDPOGS208611-TA4374 bp
ProteinDPOGS208611-PA1457 aa
Genomic positionDPSCF300052 + 392702-408723
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0165920.063.67% 
BombyxBGIBMGA013427-TA0.066.86% 
DrosophilaAnk2-PU0.051.98% 
EBI UniRef50UniRef50_Q16PP60.053.68%Ankyrin 2,3/unc44 n=11 Tax=Coelomata RepID=Q16PP6_AEDAE
NCBI RefSeqXP_001809144.10.054.82%PREDICTED: similar to ankyrin 2,3/unc44 [Tribolium castaneum]
NCBI nr blastpgi|1892359510.054.82%PREDICTED: similar to ankyrin 2,3/unc44 [Tribolium castaneum]
NCBI nr blastxgi|1892359510.054.44%PREDICTED: similar to ankyrin 2,3/unc44 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.2e-07protein binding
KEGG pathway 
InterPro domain[320-676] IPR0206832.7e-111Ankyrin repeat-containing domain
[954-1058] IPR0009067.9e-50ZU5
[120-149] IPR0021102.2e-07Ankyrin repeat
Orthology groupMCL10139 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208611-TA
ATGGTGAGACATTTATTGGAAATGTTGAACCTCATTATTTATGGCATGGAGTGGGTCAACATACAATCCCAAAATGGTTTTACACCATTGTATATGGCGGCGCAGGAAAACCACGATGGAGTGGTTAAATTCTTGCTCGCTAATGGCGCCAACCAAAGCTTGGCCACTGAGGACGGCTTCACCCCCCTAGCTGTTGCGATGCAGCAGGGTCATGAGAAGGTGGTAGCTGTACTCCTCGAGGCTGACACGCGTGGTCGAGTTCGTCTGCCAGCGCTACATATAGCTGCTAAGAAAGACGATGTCAAGGCTGCAAATTTATTACTCGAGAATGAACATAATCCAGACGTGACATCTAAATCCGGATTCACCCCCCTCCACATTGCCGCCCATTATGGCAATGAGTCTGTTGCTCGGCTGCTTTTGGCTAAAGGTGCTGACGTCAATTGCGCCGCCAAACACAACATCTGTCCACTCCACGTGGCCGCTAAATGGGGAAAAGACAATATGGTATCTCTGCTCTGCGACAATGGCGCTAATGTGGAAGCAAGGACCCGTGATGGCCTGACACCATTACACTGCGCGGCTAGGTCAGGTCATGAGAGAGTCGTTGAAGCCTTGTTAGATAGAGGTGCTCCCATAACTAGCAAGAGTAAGAATGGTCTAGCGCCACTGCATATGGCTGCCCAGGGTGACCATTCTGAAGCCGCTCGCGTGTTGCTTTCTCGACGGGCACCTGTAGATGACGTCACAGTGGATTATCTCACCGCTCTGCATGTCGCTGCTCATTGTGGGCATGCGAAAGTCGCTAAGCTGCTTCTGGATAGGAATGCCGATGCAAACGCACGGGCTCTGAATGGTTTTACACCTCTCCACATTGCTTGCAAAAAGAATAGGATTAAGGTTGTGGAACTTCTCTTAAAATATGGAGCGAGTATCCAAGCAACAACTGAATCAGGCCTAACTCCTCTTCACGTCGCATCTTTCATGGGCTGCATGAATATTGTAATCTACCTTCTTCAGCATGAAGCTAACCCTGACGTTCCTACAGTTAGGGGAGAGACACCTTTGCATTTAGCTGCCAGGGCTAATCAGACGGACATAATTCGCATCCTGTTGCGCAACGGTGCGGCAGTAGAAGCGAAAGCACGTGAGAGACAGACCCCACTTCATATTGCATCTCGTCTTGGCAACGTGGATATTGCTGTGCTGTTGCTGCAGCATGGAGCGGACGTCAGAGCCATGACTGCTGATCATTACAACGCATTGCACATTGCTGCTAAACAGCATAACCATGACGTGGCAGCCGCGTTGATTGAACACAACGCTCCCTTAACTGCTACCACCAAAAAAGGCTTCACGGCATTACATCTTGCTGCCAAATACGGTAACCTGAAGGTAGCGAATCTTCTGCTTGCTCACGGGGCTTCCCCGGATCAGGCGGGCAAGAACGGCATGACACCTTTGCACGTGGCAGCGCAGTATGACCAACAGGCTGTTGCTAATACGCTACTAGAGAAGGGAGCAGATGCTAAGGCGGTAGCTAAGAATGGACATACACCACTTCATATCGCGTCTCGTAAGAACCAGATGGAAACAGCAGCGACATTGCTCGAATATGGAGCACTGACCAATGCCGAGTCTAAGGCTGGCTTTACACCTTTACATCTTGCTGCTCAACAGGGACACACGGAGATGTGCTCCTTGTTGTTGGAGCATGGAGCGGATGCTGGTCAGCAGTCTAAGAATGGACTCGCAGCGCTTCATCTAGCTGCACAAGAAGATCGAGTGGCTGTTGCACAACTCTTGCTTAAGAACGGCGCTGAGGTGGATATATGCACTAAGGGCGGTTACACACCATTGCACATAGCGAGTCATTATGGTCAAGCCAACATGGTGCGATATCTGCTTGAGAATGGCGCTTCTGTCAAAGCGGAGACTACCCATGGATATACTGCCCTCCACCATGCTGCTCAACAAGGCCACATTAATATAGTCAACATTTTGCTGGAGCATAAAGCTGACGCAAATGCTATTACTACGGGACACACGGAGATGTGCTCCTTGTTGTTGGAGCATGGAGCGGATGCTGGTCAGCAGTCTAAGAATGGACTCGCAGCGCTTCATCTAGCTGCACAAGAAGATCGAGTGGCTGTTGCACAACTCTTGCTTAAGAACGGCGCTGAGAAAGTCCACCTATATACCACTAAGGGCGGTTACACACCATTGCACATAGCGAGTCATTATGGTCAAGCCAACATGGTGCGATATCTGCTTGAGAATGGCGCGTCTGTCAAAGCGGAGACTACCCATGGATATACTGCCCTACACCACGCTGCTCAACAAGGCCACATTAATATCGTCAACATATTGCTGGAGCATAAAGCTGACGCAAATGCTATTACTACGAATGGTCAAACTCCTCTCGACATAGCGTCTAAGCTCGGCTACGTCACAGTAATGGAGACCCTTAAGGAAGTTTCTGAACCCTCAATAGCTCCCGCATCTCAGGACAAGTATAAAGTGGTCGCCCCAGAAACCATGCTGGAAACTTTTATGTCAGATTCAGAAGAGGAAGGAGGGGAAGATACGATCCTCAACGACCAGCCGTATCGATATCTTACCGCAGACGACATGAAGTCTCTAGGTGATGATTCTTTGCCAATAGACGTCACCAAGGACGAACGTACAGAGTCCGCTATGAGTCACAAAAATATTATGGAGATATCCCAAGGATCAGTCAATGGTATGCCATACCAAACCCAAGAAGTGGTCGTGAAGAGTGTGAGTCGCTACAGCGCAGCTCCTGAAGGCTACTGCTATAATGTGGATCCCACCCTACCCAAGAAAAAATTACAATGGAAAAATTTCCTGGTTTCATTCCTGGTAGACGCACGTGGAGGTGCGCTCCGCGGCTGCCGTGGTGGTGGAGTGAGAGTCATCGTCCCGCCTTTATCTGCTCAACAGCCAACTAGAATCACTTGCCGATATTTAAAACCATCTCGGATCAACCATATGCCACCTCTGATGGAGGGAGAGGCACTTGCTTCAAGAGTTTTGGAAATGGGTCCGGTATCAGCAAAGTTTTTAGGGCCAGTAATCCTAGAAGTACCCCACTACGCATCGCTCCGTGGTAAAGAACGTGAAATCGTGATTCTCCGTTCAGACAATGGAACCAGTTGGCGTGAACACAACGCCGATGCTACTGATGATGTAGTCCAGGATATACTGCATGAGACACTAGAGATTGAAGAAACTAATGAAGATGAAAAGTCCTGGGACGCTCCAAGGGTGACTCGCATCCTCACACACGACTTCCCTCAATACTTCGCGGTCATCTCCCGCATCAGACAGGAGGTGCATGCCATAGGACCTGAAGGTGGCATGGTATCCAGTTCAGTTGTGCCTCAAGTACAAGCAGTGTTCCCACAGGGAGCTTTGACCAAGAAGATTAAAGTAGGCCTTCAGGCCCAAATAATAGACTCGGAATTAACTGCAAAGCTATTAGGTCGTGGAGTAGCTGTTTCACCTGTAGTTACCGTAGAGCCAAGACGGCGAAAATTCCATAAGGCCATAACCCTCAGCATGCCAGCCCCTAGACCACACACCCAAGGAATGACCAATCAATACAGCACATCATCAGCGCCAACCTTGAGACTTCTTTGTTCAATATCAGGTGGAACCAACAGGGCGCATTGGGAAGATGTGACTGAAAATACTCCGTTGACATTCGTAAATGATTGTGTTTCCTTCACAACAACGGTCTCTGCACGTTATTGGCTCATTGATTGTCGACATGTGGAAGACGCCACCAAAATGGCTACTGAATTATATCGGGAAGCAATCCACGTTCCGTTTATGTCACGATTTGTTGTTTACGCGAAACGAACTGATGAATGTCAAGCGCAGCTTCGCATGTTCTGTGTCACTGATGATAAAGAAGACAAAGCCTTGGAGCGAATTGAAAGATTCATCCAAGTTGCTAAGAGTCGTGACGTAGAGGTACATGAAGGTAAACCGGTCTATCTAGAATTTGGAGGCAACTTAGTTCCTGTGGCTAAATCAGGCGAACAACTTTCAATACCGTTCCGAGCTTTCCGTGAAAACCGTGTCGCTTTCCCGGTTATGATTAAGACTCAAGACCTGGAACCCATCTGCAGATGTCAGTTTATGAGAGATCCAAAAGTACCAAAAGGGGAACCATCACCAACACCTATCGCTATTCTGAATATCATGGTACCGGATGATTTGCCAGTGGAAAGAATCTCACCAGTACCATTGGACATGATGCCGCGCCGCACAGAAGAACAAGAACTTATTTGGAGGCAGAGGCTTCGAGATCCTAGGAATTTGCCTCTACAAAAAGCAGACATTTGA

Protein sequence:

>DPOGS208611-PA
MVRHLLEMLNLIIYGMEWVNIQSQNGFTPLYMAAQENHDGVVKFLLANGANQSLATEDGFTPLAVAMQQGHEKVVAVLLEADTRGRVRLPALHIAAKKDDVKAANLLLENEHNPDVTSKSGFTPLHIAAHYGNESVARLLLAKGADVNCAAKHNICPLHVAAKWGKDNMVSLLCDNGANVEARTRDGLTPLHCAARSGHERVVEALLDRGAPITSKSKNGLAPLHMAAQGDHSEAARVLLSRRAPVDDVTVDYLTALHVAAHCGHAKVAKLLLDRNADANARALNGFTPLHIACKKNRIKVVELLLKYGASIQATTESGLTPLHVASFMGCMNIVIYLLQHEANPDVPTVRGETPLHLAARANQTDIIRILLRNGAAVEAKARERQTPLHIASRLGNVDIAVLLLQHGADVRAMTADHYNALHIAAKQHNHDVAAALIEHNAPLTATTKKGFTALHLAAKYGNLKVANLLLAHGASPDQAGKNGMTPLHVAAQYDQQAVANTLLEKGADAKAVAKNGHTPLHIASRKNQMETAATLLEYGALTNAESKAGFTPLHLAAQQGHTEMCSLLLEHGADAGQQSKNGLAALHLAAQEDRVAVAQLLLKNGAEVDICTKGGYTPLHIASHYGQANMVRYLLENGASVKAETTHGYTALHHAAQQGHINIVNILLEHKADANAITTGHTEMCSLLLEHGADAGQQSKNGLAALHLAAQEDRVAVAQLLLKNGAEKVHLYTTKGGYTPLHIASHYGQANMVRYLLENGASVKAETTHGYTALHHAAQQGHINIVNILLEHKADANAITTNGQTPLDIASKLGYVTVMETLKEVSEPSIAPASQDKYKVVAPETMLETFMSDSEEEGGEDTILNDQPYRYLTADDMKSLGDDSLPIDVTKDERTESAMSHKNIMEISQGSVNGMPYQTQEVVVKSVSRYSAAPEGYCYNVDPTLPKKKLQWKNFLVSFLVDARGGALRGCRGGGVRVIVPPLSAQQPTRITCRYLKPSRINHMPPLMEGEALASRVLEMGPVSAKFLGPVILEVPHYASLRGKEREIVILRSDNGTSWREHNADATDDVVQDILHETLEIEETNEDEKSWDAPRVTRILTHDFPQYFAVISRIRQEVHAIGPEGGMVSSSVVPQVQAVFPQGALTKKIKVGLQAQIIDSELTAKLLGRGVAVSPVVTVEPRRRKFHKAITLSMPAPRPHTQGMTNQYSTSSAPTLRLLCSISGGTNRAHWEDVTENTPLTFVNDCVSFTTTVSARYWLIDCRHVEDATKMATELYREAIHVPFMSRFVVYAKRTDECQAQLRMFCVTDDKEDKALERIERFIQVAKSRDVEVHEGKPVYLEFGGNLVPVAKSGEQLSIPFRAFRENRVAFPVMIKTQDLEPICRCQFMRDPKVPKGEPSPTPIAILNIMVPDDLPVERISPVPLDMMPRRTEEQELIWRQRLRDPRNLPLQKADI-