Monarch geneset OGS2.0

DPOGS206105
TranscriptDPOGS206105-TA5877 bp
ProteinDPOGS206105-PA1958 aa
Genomic positionDPSCF300028 + 347981-359090
RNAseq coverage535x (Rank: top 24%)
Annotation
HeliconiusHMEL0039190.091.79% 
BombyxBGIBMGA006830-TA0.089.55% 
DrosophilaNipped-B-PE0.044.94% 
EBI UniRef50UniRef50_D6W7B20.053.21%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W7B2_TRICA
NCBI RefSeqXP_971316.20.053.21%PREDICTED: similar to delangin [Tribolium castaneum]
NCBI nr blastpgi|2700146710.053.21%hypothetical protein TcasGA2_TC004719 [Tribolium castaneum]
NCBI nr blastxgi|3504076860.053.90%PREDICTED: nipped-B-like protein-like [Bombus impatiens]
Group
Gene OntologyGO:00054881.8e-29binding
KEGG pathwaytca:6599580.0 
 K06672 (SCC2, NIPBL)maps-> Cell cycle - yeast
InterPro domain[824-1680] IPR0160241.8e-29Armadillo-type fold
[1492-1672] IPR0119895.6e-07Armadillo-like helical
Orthology groupMCL13119 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206105-TA
ATGAATGAGAGGGATATTCCAAGTGTACCCATTACCACATTAGCAGGAGTAGCTAGTTTAACTGACTTGTTACCAGTGCTTCCATTACCAACACCTTTGCCGTCAACACTGAACAACAAGTCTCAGTTGTTTCACCCTCGTGTTGCAGAAGAAGCAGCTATACTGCTTGCAACTCGTGATGACAATCTTGTCTCCTTACTGTTACAGTCACTTATGCAGACATCTGTTCAGCATATTGACTTGAAGGATGAGAGTATACCAAATGCAGTTAGTCCTGCTCCTCCACAAGCAACTCCAGAACTTTTGAAAGCAATATTACATGTTAACCCCAATGTCTTTGATCAGCAACAAAGAAATCACTGGGGTAACCATTTACATGCATCCAAGTTGAATGGGATGTCACCCTCTTCCACATACAATTATGCATCACATAACACTGTTCACTCGAGTCCTTCATCACATGGTACTGCTGTACATTCAGGGTCTATGCCTAACCAGCAAGTGTTGGCGACACCATGCCAGTCATCCCCTCATGTAAACATGGGTTCCCCACCCTCTCAAGCTCGACCAGGGAGTCAAGTTAATATCACCGCTCAAAATAATGTTTCACCACCTTGGAGTACTTCGAACCCATCTAATATATACACAAGCCCTTTATCGAATTCTTCGTGTTCGAATCCTCAAAATGGTGGATTTTATAATAGTTTAGTGCAAGAAAATAGAAATTTTCTCACTAACATTGGTGACGATAAGGATCCACTTTCTATGTCTCCGAGAAATATGACTGAACAACAGAAGCAGCATTTACAAGTCCAGGATAAGAATTACGAACAAGTTCCAGGACAGTCTCCAGCTCAAGCTAGCGTGGCGCCTTCGCCTTTGCGACAGGAACCTCAAGTGCCTCAGCCGTGTATGGGTGCCCCTTACGCCCCTCCAGCTACACAAAGCACACAAGAGAAACAAGACCCCAGCAGAAGCACACAAGTTATGAACACTAGATACCCTGTTGTTAAATTAGGTCGACTCTCTGAGGGAATGCTAAAGAAACACGATGTGTCTCACGACGAAAGGTCGAGGAAAAATAGTACAGTGGATTCGGATTCTGACGACAATTTGCCTTTAAAAGCTAAGTCGAGTAGCTCCACTCCCACTGTAGATAAAGAAAGAGAAGCTATAGATATGGCTAGAGAAAAGAACTGGGAAGCGGTTAAGAGGAAACATGAAGAAGCCAGTAGAAGAGAAGAAGCAGAGGCGGCTATCGTGAGACCCAAGCTGCGAAAAGTGGAAAGAAGATTGGTGCCAGTTCTGGAAATGTTATCTGTTGATGAACTCATGGAGACAAATACATATCAACGGTTCAACAAATTAATAGAATCCGTCTTCGAAACCATCGATGATGACGTCATTATCACGGACGAAATGGAGGGTACAGATATTCCACAAGATATGCTATTACCTCGATATCAACTACAAGAGTTGTGTTCCGAAGCGGCGAAATTAAAAAATTTGGGTGCTATGGAAGCCATTCCTTCGGACAGATTAGTACGATTGTTAAACATATTAGAAAAAAATATTAGGGCTGCCGAAAAGATGTCTCTAGTGGGGGATCCAGAAGATAGTGAAGAGATGCGACAGATCTGGATGGAATCGGCGTTAGAGCGTGTGATGTGCGCCTCGGACGCCTGTCTCACGGCATTGTATATAATGACGTCACCCAGCATGCCCAAACGAATATTCCTAGAAGATGTTATAGATAGGATCATTATGTTTATTAAGTTTCAACTAAACAATACCATATACTGTGTATATGACCCCGTGTACAGTATCCAGAGTACTTCCAAGAAGAAAGTGGACGGCCGTAAGCGTCGCGGTGGCGGTGCAGGTCACGCCAGTCGGCGGTGCGGTAGTTCGACCAGAGCGGTCCGCGAGCTGTACACTCACGCTCACGAGTGTGTCACGTTGTTGTCTGAACTGTTCGCTGCCCACCACCTCACTGACACCACAGTCCTACACGCCTCTACCGTCGGAGTATCGCCGTTCTTCGTAGAGAACATCAGCGAACTGCAGCTGTCAGCCCTCAAGCTTGTTACAACTATTTTTACAAAATACGAACAACATCGCCGGCTACTGCTTGAGGATATCTTAGCTTCAATAGCTCGGATTCCTAGCTCGAAACACAATCTACGTTCCTTCCAACTGAGTTCCGACCAGCACATACAGATGTTGACCGCCCTTGTTTTACAATTGGTTCAGTGTGTTGTCACGCTCCCGGAGACGCTCTGTAAGACACAGGACAAGGATAAGGATAAGGAGAAGGAAAAGGAGCATGTTGAGTCTGATAGTAAAAAGCCCGTGGATAAAGACCTAACAATAATATCTAAATACGAAGCAGCAATCAGTGTAGGCGGAACATTTCTCACATCATTTCTAAACAAGTGCCGCAGTCGGAACGAAGAAGTCGACTTCAGGCCCCTGTTTGAGAACTTCGTTCATGATCTTTTAACTACTGTTAATAAACCCGAATGGCCGGCTACAGAATTGCTGCTAAGTTTACTCGGTACAATGCTGGTGAAGTACATGTCAGACAAATCAATGGAGATGTCTGTACGTGTGGCGTCGTTAGAGTACCTGGGGTTGGTGGCGTCGCGGTTGAGGAGGGACAGTGTGACGTCACGAGCCAAGCTCGCCACCATGGACGCCGTGGTGAGGGACATTCGGGCCGAAGAGGAGAAAGATGGATGTCAGCAGCAGTCGCTGACGTCAGGTTTAGACGAAGATGAAGAGAGAACAGAATTTCTTCAAAGGGTATTGCTTGATTACCTTGCGATAAATGGTCAAAAGGATCAGGCGTGGAATTGTGCCAGGCATTTTTATATTACGCAATGGTACCGAGATATGGTCGTCCAACCGAAGAGCTCCTCACCAACAAAAAGACCGAAGAATAAGTCAAAGAAAAAATACAAAGTCGAAAGCAGTGAAGAGGAATCTGACGCAGACGACGACAGCGACGGTGGCATCAAGGAATCGAAGGCCGACAAGAAACTACCTCCGAGCGCTCTGATGTCCGCCGAGAAGTTCAAAACCATTGAACGGCGGAAGCATTTCTTCCTCGAAAAGATTAGACCGTTCCGTTATCAAGGCGGGACCCAGGTGCAAGTGATGCAGTCCTACATAGATTATAGCGGCGCGGAGTTGATCTCGCAGTATCTGGCGTCGAAGCGATCTTTCTCCCAGAGCTTCGACAGGTATCTGAGGAAGATCCTGGTGATTCTCTGCGAGAATACAATAGCAATTCGTACGAAAGCTATGAAGTGTTTGACGATGATAGTAGAGGCGGATCCGGCGGTGCTGGCGCGTCCCGACATGCAGATCGGTGTGAACCGCTCGTTCCTAGACCAGTCCACGGCAGTACGGGAGGCGGCCGTAGACCTTGTAGGAAAGTTCGTGTTGAGTCGACCAGAACTCATCGACAAGTATTATGGAATGCTCTCAAATAGAATATTGGACACCGGTGTGTCGGTGAGAAAACGAGTGATAAAGATATTGAAAGACATTTGTATAGAATGTCCAGAGTTTCCAAAAATACCTGAAATATGTGTGAAAATGATAAGGAGGGTTAACGATGAAGAGGGTATAAGGAAGCTAGTTATGGAAGTGTTTCAAAATATGTGGTTCAGTCCGTGTCCCAACTCTTCGAGGCACGGTGCGCTGGATATAACAGCCGCCACGGCAGATCCGCTCACTAGAAAAGTACTGAACATAACAGATGTTGTACTATCGTCTAGAGATATGGGATTGGAATGGTTTCAACAACTGTTAATGAGTTTATTTAAACCTAAGGAAGATAAGGATGACTCCACAAAAATTATTTATCAACCGCCTAAATCTCTATTGGTAGCTTGCCAACAAATTGTTGACTGCCTTATCGAACACTTACTGCAGTTGGAAGAAACCAATACGGATGGAGCTGGTTCGTCACAACGTATTTTAGCGTGCCTCTCCACATTGCATTTATTTGCAAAAATTAGACCGCAGCTATTGGTTAATCATGCATTAACACTCCAGCCTTACCTCAGTCTAAAATGTCAGAATCAATACGAGCAACAAATAATGTCAACGGTTGCTTCCACGTTAGAGCTGGTGGTTCCTCTCATGGAACATCCTAGTGAAGTATTCCTCGCTCAGTTGGAGGAAGACGCCGTGAAGCTCATCTTGCAGCGCGGCCAGCTCGTCATAGCTGCCTGTATAGCATGTCTAGCGGCCATTGTTAACAACCTGACGCACAATTATAAACTCATTAGGGATGTTTTCAATAAGTATCACGGAGTTCTACTGCAATGGAAGCAGAGTTGGCAGCGGAACGCCGAGATGACACGCGGGCTTCACACGAGGCCTCACTTTAGACGAGCGTTGTTCATTGTAGGACTGCTATTGCGGTATTTTGATTTCACTGAAAGCAGAGTCATAGAGGGTCTTGCGACTGACATCAAGGAGCAAGTGTACTCTACGTTAATGTTTTTCGTCGGTCTCGAAGACGAAGATTTCGTGTCACATACGCTCAAAGCCCTCGGTTCAGTTTGTGTGCGACATTACGAGTTTATGTTGAGGCCGGAATTGAAGGAGTTTTATCATCAGCTATTGACATCAGAACTGGCACCCATAGAAATGAAAGCAGACGTGCTTAGAAATATTGAGATGTACTTACAAGAGGAGGAACAGAGAATGATCAGACAGGATAAGGAATGGTCAAAAAGATCAAAACATGAAAACTTAAAGGAGATGGGCGACGTGTCTTCCGGAATGGCGTCAACAGTGATCCAATTGTACCTTAAAGAAATCCTGGGATCATTTTTGCACGCGAGCACGGTAGTTCGCTCCAGCGCTATGAAAGTAGTGCAGTTGGTGTTAGCACAAGGGCTGGTCCATCCTGTACAGATTGTTCCATATCTAATATGTATGAGCACTGACACAGAGGTGACAGTGTCACACACGGCAGACAAAAATCTCCAAGAAATAGATAAAAAATATCCAGGATTTATACACATGAAGGCTCAACTTGGTATAAAACTGTCGTACCAGCTACAAAAAATATTACAGAATAACAAGAAGGGAGTAATTAGAGGGTTCAGGAAAAAGGAACAAGACGACTTGCCAACCGCTCTTAATGGTTTTTTATATTCACTGTTAAGAAACACGCGGCCGCAGAGACGAGCATTAATACTATCCTTATTGAAACAATTCGATGACGTTTCTACCGCTCCATTGGATCAAATGCTCTATCTGGCTGATAACTTAAGCTACTTTCCTTTTCAAGTTCAAGATGAACCACTTTTTATAATACACCATATTGATATTATAATTTCTACGTCGGGATCAAATTTATTACAAATTTTTCGGGAGGGTCTTTTAAAAACTGGTAGTGAAGAGAAGGAGCCGCTCGACGAGGAAGAGGACGAGGAGGAGGCGGAGGCGCTGGTGGCGAGACTGCCGCCCTCCACCAGGCCGCTGAGGGACGCCATGAGACAGGCCCGAGGCTGTCTGCTCTTACTCGTTCTCAAACAACATCTCAAACAACTCTACGGTTTCACAGACGCTAAAATTAGTCAGTATTCACCGTCGGAGAGCGTGAAGGTGTACGAGAAGGCCGTCTCTAGACGGCACGCTCCTCAATTCGAGCCCAAGGCGACGATCGCTCAGCTGCATCAGAAGGACGACGACCAGGAGCTGGATGAGCGTGGCCGGCGGAGACTTATAGATGACTATCTCGAGGTACTACTACATATGGCGCGGGAGGGGGCGCCACCTAGCCGCCGCCTCCCGTGCGCCCCGGACGATAAACTGACAAGTGACAGTGCAGTGCAGTGCGCCCAAGCCTAA

Protein sequence:

>DPOGS206105-PA
MNERDIPSVPITTLAGVASLTDLLPVLPLPTPLPSTLNNKSQLFHPRVAEEAAILLATRDDNLVSLLLQSLMQTSVQHIDLKDESIPNAVSPAPPQATPELLKAILHVNPNVFDQQQRNHWGNHLHASKLNGMSPSSTYNYASHNTVHSSPSSHGTAVHSGSMPNQQVLATPCQSSPHVNMGSPPSQARPGSQVNITAQNNVSPPWSTSNPSNIYTSPLSNSSCSNPQNGGFYNSLVQENRNFLTNIGDDKDPLSMSPRNMTEQQKQHLQVQDKNYEQVPGQSPAQASVAPSPLRQEPQVPQPCMGAPYAPPATQSTQEKQDPSRSTQVMNTRYPVVKLGRLSEGMLKKHDVSHDERSRKNSTVDSDSDDNLPLKAKSSSSTPTVDKEREAIDMAREKNWEAVKRKHEEASRREEAEAAIVRPKLRKVERRLVPVLEMLSVDELMETNTYQRFNKLIESVFETIDDDVIITDEMEGTDIPQDMLLPRYQLQELCSEAAKLKNLGAMEAIPSDRLVRLLNILEKNIRAAEKMSLVGDPEDSEEMRQIWMESALERVMCASDACLTALYIMTSPSMPKRIFLEDVIDRIIMFIKFQLNNTIYCVYDPVYSIQSTSKKKVDGRKRRGGGAGHASRRCGSSTRAVRELYTHAHECVTLLSELFAAHHLTDTTVLHASTVGVSPFFVENISELQLSALKLVTTIFTKYEQHRRLLLEDILASIARIPSSKHNLRSFQLSSDQHIQMLTALVLQLVQCVVTLPETLCKTQDKDKDKEKEKEHVESDSKKPVDKDLTIISKYEAAISVGGTFLTSFLNKCRSRNEEVDFRPLFENFVHDLLTTVNKPEWPATELLLSLLGTMLVKYMSDKSMEMSVRVASLEYLGLVASRLRRDSVTSRAKLATMDAVVRDIRAEEEKDGCQQQSLTSGLDEDEERTEFLQRVLLDYLAINGQKDQAWNCARHFYITQWYRDMVVQPKSSSPTKRPKNKSKKKYKVESSEEESDADDDSDGGIKESKADKKLPPSALMSAEKFKTIERRKHFFLEKIRPFRYQGGTQVQVMQSYIDYSGAELISQYLASKRSFSQSFDRYLRKILVILCENTIAIRTKAMKCLTMIVEADPAVLARPDMQIGVNRSFLDQSTAVREAAVDLVGKFVLSRPELIDKYYGMLSNRILDTGVSVRKRVIKILKDICIECPEFPKIPEICVKMIRRVNDEEGIRKLVMEVFQNMWFSPCPNSSRHGALDITAATADPLTRKVLNITDVVLSSRDMGLEWFQQLLMSLFKPKEDKDDSTKIIYQPPKSLLVACQQIVDCLIEHLLQLEETNTDGAGSSQRILACLSTLHLFAKIRPQLLVNHALTLQPYLSLKCQNQYEQQIMSTVASTLELVVPLMEHPSEVFLAQLEEDAVKLILQRGQLVIAACIACLAAIVNNLTHNYKLIRDVFNKYHGVLLQWKQSWQRNAEMTRGLHTRPHFRRALFIVGLLLRYFDFTESRVIEGLATDIKEQVYSTLMFFVGLEDEDFVSHTLKALGSVCVRHYEFMLRPELKEFYHQLLTSELAPIEMKADVLRNIEMYLQEEEQRMIRQDKEWSKRSKHENLKEMGDVSSGMASTVIQLYLKEILGSFLHASTVVRSSAMKVVQLVLAQGLVHPVQIVPYLICMSTDTEVTVSHTADKNLQEIDKKYPGFIHMKAQLGIKLSYQLQKILQNNKKGVIRGFRKKEQDDLPTALNGFLYSLLRNTRPQRRALILSLLKQFDDVSTAPLDQMLYLADNLSYFPFQVQDEPLFIIHHIDIIISTSGSNLLQIFREGLLKTGSEEKEPLDEEEDEEEAEALVARLPPSTRPLRDAMRQARGCLLLLVLKQHLKQLYGFTDAKISQYSPSESVKVYEKAVSRRHAPQFEPKATIAQLHQKDDDQELDERGRRRLIDDYLEVLLHMAREGAPPSRRLPCAPDDKLTSDSAVQCAQA-