Monarch geneset OGS2.0

DPOGS208619
TranscriptDPOGS208619-TA3321 bp
ProteinDPOGS208619-PA1106 aa
Genomic positionDPSCF300052 + 645397-650549
RNAseq coverage203x (Rank: top 47%)
Annotation
HeliconiusHMEL0165840.065.14% 
BombyxBGIBMGA005723-TA0.048.73% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|1504011403e-0621.56%SMC domain-containing protein [Methanococcus aeolicus Nankai-3]
Group
Gene OntologyGO:00055151.8e-06protein binding
KEGG pathway 
InterPro domain[663-727] IPR0090571.8e-06Homeodomain-like
Orthology groupMCL16949 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208619-TA
ATGGAGCAAATCGTGGTTAAAACTGAGGTGCAACCAAACGGAGACATACTACTATTTTATGTTGATGAAAATGAAGAAGCAGATGCTGGGACATTTGACCATGAAAATGGGAATGTCATGCATATGCTGCCGGAAACAAAATATGTGATAGAAGATAACGGAGAAGGTGAATCTGCCGATGAACTGGATATAGCACAAGCCGCTGAAGAAGTTGCCAAAGAAAATTGGCTGAATGAGGAAATAAAACGTCTTATTGTTTTTTATATTGATAATAAAGAAACATTTCTTAGTGGGACAACAAAGAAAAAACATTTGTGGGCTGTAGCTTGCAGAACTATACTCACAGGCAAAAATCCAGTTTCATGTGAGGTTAAATTACGAATGCTCAAACAACAGTTCATACAACTATGTTTAGAAAAACAGAAAGGAAATATTGTCACTTGGCCCTATTACGATCTTTGCCATCAAGCATTCTATGAAGATAGCTTCGAAACTGTCACAAATGATTCCGAACCAAATAAAGTTGTCATTAACATGCCCATGCAGAATGTTGTCAATCAAGATGGTATCTTAGTGGTAAAGAAAATTAATAATGGCCAGGGTAAAGACGTTGATGAAAAAGTTGAAGCTATGCTCAAACTTTATATAAAGCATAAGAAAACATTCAAGAGGAATAATCATATGCCGAGAGGTTTGTGGGAAACCATAGCTTTAGAAATGGGTGAAGACGACGTTGAATACTGGCACAAAAGGTTTCTGAATTTCAAACAGCATTATGTCAGAATGCTTTATAAAAAGAAAGAGAGCGGCCCTGAAAATATCCATTGGCCCTATATGAAATATTTCGATCACATTTTCTGTGATGATGAGGAGTTTCAGAAGAAATTTCTCCCCAGCAATGAAGATGAAGACAGCCAAGACACTGAGCTCCAAGAAGATGTATGGAGTGATACAGAAAAAACTTTTCTAGTCAAATATTACTATGATTGCTTCCAAGAATTTCAAGATTCGACCATACCGAACAAGTTTCTATGGCAGGAAGTCGGCAGGTTACTGGATAGAAAACCGGATGTCTGCAAGGATAAATACAATGAGCTAAAGAACCAACATTTCAACTTACTCCTAGAGAGAGTCTATGATATGACAAATAGAGTTCCGATGGCCATAATTTTTGACAATATAATCGCAAAGGAGACGTTGAACGAATTAGATCAGCCAGCGAGAAAGTCGGATTCATCCGACATATGGAAAACTGAGCAAATTGACGAATTGGTTGAATATTTGTATGAAAATATAAATATGTTAAAAGATCCCATATGTTACTACGTCTGTTGGGCCACGATATCTAAGAAAATCGATAGAACGGTACAATCCTGTATGAGACAGTGGGATTCGTTAAAAACTCTTTATAAAAATATCTTGGAGGATAAGAAGGAAAATTCTGACATGCAAATAGATTGGAGATACATAGATCTGTTTGATAGGATATTTGACTATGGCATGGATACCAACTTGCTTGACGGCTACGAAAAAACGAAAGAACATAGTAATGACAGTAGTAACGGAAAAGTTGGGGTTAAAAAGATCACAATTAAAGGAGATTACGAAAATTTTAATGTGGATGGTACGGACGATGAGGAGTCGTACGACGAGAGAGGTTTCACGAAACGTTCCAAGAAAAGAAACGGTGACTCGAAAGCGTTCAAGATATTAGAGTACTATTTGAAGAACAAAGACAAATTCTCATCGTCCCAACAAAAGAAACTCGCACTATGGGAAATACTGGCGAAACAAATTGGCCTATCAGCCACAGAATGTGCCCATAGATTTCGTAATTTCAAGCAAGTTTATATAGGTTACGTTCAACGAGAAATAAATAAACCGGAAATGCCAATATTATGGCCATATTACACATTATGCAAGAAAGTTTTCGGATACAGGGCTATAAAGTCCAAGTTAAAGAATGGAAAACTAGAATCTGAAGACGGGGAAGATTGGTCGGCCAAGGAAATAAAGCAATTGATAAACTATTTCTCAAGAAACTATCACGATTTGGTTGATAATATCGAAGATAAATCCAGATGGGTCCAAGTAGCACAGGAAATGGGCAGAACTGAGGGTAGTTGTTGTGATAAGTTTTTAGAATTAAGAAAGTCCTATAGGAAATTGAAGACGATGAAGTCCAGGAATCCAGATGTCAAGGTGTCATGGAAGTATTTCAATATGATAGACGAGATATATCAAAACGGCGCCGAGAATATCGAGGTCTTGGAAGAAATGGATTTTGATGAATCAAGTGCCTTTGATGTTAAGACGGAAATTCAGGAAGACGATGACTTCCAATGCATTATAGTTGTACCAGAAGGGGATGACATTAATAACGCCCAGATTATAATACAAGAAGAGAAAAACTCGACAGAAGACAACACGGAGACAGCGGAAGTTAGGAAGAATATTACAGTATGGAATAGAAGAAGCAAGAGACGGTTACTGATATTATATCTGAAATACTTAAGATTGAATAAAGAAACGGAAATCAATGCAAAAGACATGTGGACAGAGATCGCTTCACAGTTAGAAGCCAAAACACCATCGTCCTGCAAGAAAATGTACATAAAACTGAAGAATCAATACAATATTGATAAAGAAAAAAACATTATAACACCCTACAATGTCGTAATGGAAAAAATCTTGGCTATAAAACCTAGGTTTGCTAAAACCAATAAAATTAAATCATTAGATGACATAACTGTTTACAATGATGTACCAATGTCTGAAGAGAAGGTCCTCAACGCTTTAAATTACTATATGCAGAATCTAGAAGACTTTGTGAGTCCCAAATTCGAGAAGAGATATCTATGGATCGAGCTTGCCAAATATATATCTGAGTCGGTGACGAATGTATATAGCAAAATAAATTATATGAAGCAGAATCATGATACTATTGATTCACCATTCCGTGAAGTTGTTCAAGATATCATTGCCAAGGAAAATACATTGAAGGAAGTTAACAAAGATATGAGTACGGTCAAAGACGATGAACCCGAAGGACAAATCTGGTCTGATTTGGAGATAGAACGCCTCTTGACCTGGTATCTGGCACATTTGGACAAATTCAAGAATCCAAAATTCGTCAGAAGCTACCTTTGGATGGAGGCGTCGGATATACTGAAAAAAAGTCCACTCGTCTGTTCAAAGAAAATGTTAGAAATAAGATCACAATACAGAAGTATGGTTAAGGAAAATCCAGAAGAATTAAACAATTGGAAATTTTATAATCTATGTCAGAGAATTTACGGAACCGGAAAGAAGAGTAATTGA

Protein sequence:

>DPOGS208619-PA
MEQIVVKTEVQPNGDILLFYVDENEEADAGTFDHENGNVMHMLPETKYVIEDNGEGESADELDIAQAAEEVAKENWLNEEIKRLIVFYIDNKETFLSGTTKKKHLWAVACRTILTGKNPVSCEVKLRMLKQQFIQLCLEKQKGNIVTWPYYDLCHQAFYEDSFETVTNDSEPNKVVINMPMQNVVNQDGILVVKKINNGQGKDVDEKVEAMLKLYIKHKKTFKRNNHMPRGLWETIALEMGEDDVEYWHKRFLNFKQHYVRMLYKKKESGPENIHWPYMKYFDHIFCDDEEFQKKFLPSNEDEDSQDTELQEDVWSDTEKTFLVKYYYDCFQEFQDSTIPNKFLWQEVGRLLDRKPDVCKDKYNELKNQHFNLLLERVYDMTNRVPMAIIFDNIIAKETLNELDQPARKSDSSDIWKTEQIDELVEYLYENINMLKDPICYYVCWATISKKIDRTVQSCMRQWDSLKTLYKNILEDKKENSDMQIDWRYIDLFDRIFDYGMDTNLLDGYEKTKEHSNDSSNGKVGVKKITIKGDYENFNVDGTDDEESYDERGFTKRSKKRNGDSKAFKILEYYLKNKDKFSSSQQKKLALWEILAKQIGLSATECAHRFRNFKQVYIGYVQREINKPEMPILWPYYTLCKKVFGYRAIKSKLKNGKLESEDGEDWSAKEIKQLINYFSRNYHDLVDNIEDKSRWVQVAQEMGRTEGSCCDKFLELRKSYRKLKTMKSRNPDVKVSWKYFNMIDEIYQNGAENIEVLEEMDFDESSAFDVKTEIQEDDDFQCIIVVPEGDDINNAQIIIQEEKNSTEDNTETAEVRKNITVWNRRSKRRLLILYLKYLRLNKETEINAKDMWTEIASQLEAKTPSSCKKMYIKLKNQYNIDKEKNIITPYNVVMEKILAIKPRFAKTNKIKSLDDITVYNDVPMSEEKVLNALNYYMQNLEDFVSPKFEKRYLWIELAKYISESVTNVYSKINYMKQNHDTIDSPFREVVQDIIAKENTLKEVNKDMSTVKDDEPEGQIWSDLEIERLLTWYLAHLDKFKNPKFVRSYLWMEASDILKKSPLVCSKKMLEIRSQYRSMVKENPEELNNWKFYNLCQRIYGTGKKSN-