Monarch geneset OGS2.0

DPOGS212366
TranscriptDPOGS212366-TA3438 bp
ProteinDPOGS212366-PA1145 aa
Genomic positionDPSCF300019 + 104990-111660
RNAseq coverage545x (Rank: top 23%)
Annotation
HeliconiusHMEL0053110.078.36% 
BombyxBGIBMGA004658-TA0.082.57% 
DrosophilaSA-PA0.053.90% 
EBI UniRef50UniRef50_E2AT910.055.62%Cohesin subunit SA-1 n=2 Tax=Eukaryota RepID=E2AT91_CAMFO
NCBI RefSeqXP_966898.10.058.21%PREDICTED: similar to stromal antigen [Tribolium castaneum]
NCBI nr blastpgi|910830570.058.21%PREDICTED: similar to stromal antigen [Tribolium castaneum]
NCBI nr blastxgi|3838500380.058.77%PREDICTED: cohesin subunit SA-1-like [Megachile rotundata]
Group
Gene OntologyGO:00054883.8e-13binding
KEGG pathwaytca:6552760.0 
 K06671 (STAG1_2, SCC3, IRR1)maps-> Meiosis - yeast
    Cell cycle - yeast
    Cell cycle
InterPro domain[217-336] IPR0137212e-43STAG
[153-868] IPR0160243.8e-13Armadillo-type fold
Orthology groupMCL10539 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212366-TA
ATGCACCGAAGAGGCGGGAAACGCATCCGAATGGATGACCCTCCCCCGGAGTATGTCAATCCTATGACACCGGCTACACCTATGACCGATTACGGTGGACAATCTGTACACGAGCCGGAAACCCCCAGCATTAATTATGCTGGCTTTAACACTGGAACCGTGGCAAACTCAACAGTAGGAGAGTCCAACAGAGAAGATATCGAAGAACATCAAGAGGAAGAAACGAGGTCACCGTCGCCAGCACCGACGAAAAGAGTAACTCGTAGCAGAGGACGAGGTGGGGATGGAGGATATGTGGGAAGACTGGCTGAGTCACCACCTCCGCCACCGCTCCGAAGAAGAGGACGAGGTGGTCGCGGCCGGGGACGAGGCAGGGGAGCCCCCCCACCACAGGCGTACTCTCCGCCTCCAGTATTGCTGCCAGGAGATGATGAAAACAGTCTTTACAATATCCTCCGGTTTAATAAAACAGCTATAAATCAAGTGGTAGACATGTGGATAGAGGAATACAAGAGTAACCGTGAGAGTGCACTGGTTCAGCTGATGCAGTTCTTTATTAATTCCTCTGGATGTCGGGGAAAGGTCACACCCAACATGGCTCAGATGGACCATACCCTCATTATCAAGAAGATGACCCAGGAGTTTGATGAGGAAAGTGGTGAATATCCATTAATAATGTCAGGACACACATGGAAAAAGTTCCGATCTAACTTCTGTGAGTTCATCCAAACTCTGGTGAAGATGTGTCAGTACTCCATCATATACGACCAGTTCCTGATGGACAACATCATATCCTTGCTCACGGGGCTGTCTGACTCTCAAGTGAGAGCATTCAGACACACCGCTACACTCGCTGTGATGAAGCTGATGACGGCGCTGGTGGACGTGGCACTGCTCACGTCTGTCAACTGCGACAACTGTCTGAGGCAGTACGAGGCCGAGCGGCTCAAGGCGCGCGACAAGAGAGCCAGCGAGCGGCTGGAGGTGCTGGTCGCCAAGAGACAGGAGCTGGAGGAGAACATGGAGGAGATCAAGAACATGCTTTCCTACATGTTCAAGTCCGTGTTCGTGCATCGCTACAGAGACACCTTAGCCGAGATCAGAGCGATCACCATGTCCGAGATCGGGATCTGGATGGAGAAATTTCCTGCTCATTTCCTGGACGATTTGTATCTGAAGTACATCGGTTGGACTCTCCACGACAAGGTGGGCGAGGTCCGTCTCCGCTGCCTGCAGGCGCTGCAGCCGTTATACGAGTGCGAGGAGCTGAAGAGCAAATTAGAGCTGTTCACGTCCAAGTTCAAGGACCGCATCGTGTCAATGGCCCTCGACAAGGAGACCGAGGTGGCCGTCCACGCCGTGAGACTCGTCATCGCCATACTCAAGATGCATCCCGACGTCCTGACCGACAAGGACTGCGAGAACGTTTATGAACTGGTGTACTCGTCGTGGCGCAGCGTGGCTGCGGCGGCGGGGGAGTTTCTGAACGTCCGCTTGTTCCGCCCTGATGACCCGGGCGCTCCGCCTGCGCGCTCGCGGCGCGGCAAGCAACGTCTACCCAACACGCCGCTGGTGCGCGACCTCGTGCAGTTCTTCATCGAGTCGGAGCTGCACGAGCACGGCGCCTACCTCGTGGACTCCCTCATAGAGTCCAACCCCATGATGAAGGACTGGGAGTGTATGACGGACCTGCTGCTCGAGGAGCCCGGGCCCACCGAGGAGCCGCTCGACAACAGACAGGAATCGTCCCTGATCGAGCTGATGGTGTGCTGCGTGCGCCAGGCCAGTACGGGCGAGCCGCCGGTGGGTCGGGGCGCGTCCCGCAAGCAGCACCAGGCGCTGTCCAAGGACCAGGCCAAGGCGGCCAACGACGACCGCGTCAAGATGACAGCACACTTCATGGTGGCGCTGCCGGCGCTGCTCGACAAATTCTGCGCCGACCCCGAGAAGCTCAATAACCTCGTCACCATCCCGCAGTACTTCGACCTCGAGCTCTACACCACGCAACGGCAGGAAGGAAATCTGACGCTGCTGTTGAACAAGATCCGGGAGATCGTCAGCACTCACACCGAGGCCGAAGTGCTGGAGACGTGCGGCCGGACGCTGGAGTACCTGTGCAGCGAGGAGCATGCAGTCTACACGCGCTGCAACGTGGCGCGCGCCACGCTCACCGACATGTGCGTCAACAGATACAAGGAGGCCATCGACGACTACCGGAACCTCATCGAGGGGGGGGAGACTCCGGACGCCGACGAGGTGTTCAACGTGATCAACTCGCTGCGCAAGGTGTCCATCATGTACATGTGCCACAACCTGAACGACACCAACATCTGGGACTCGCTGTTCGAGGACCTGCCTAAGTGCGTATCGCCGGGGCTGATGCCGACGCAGGCGCTGGTGTACGTGGTGCGCGCCTGCTTCTACTCCGTGCTGTGGTCGCTGCACGAGCTGGACGAGCGCGGCGGGGACCCCGCGCCCCTGAGGGAGCGACTGTTGGCCTACGCCGCTCACTGTCGCAATATCGTCGCGGCCGGCGCGACACCCGACCTCAAGGAGGAGGCCTACACGAGTCTTTGTGACCTGCTGATCTTCTTCGCGGAGTGTCCGCGCGGCGGCTCGTCGGCCCCGGGCGCCGGTCTGCGCGCTCTGGAGGCGGACAGCGCCACTATGGACCTGCTCAACGCCTTCGTGCAAGAATTCGTGTTCGTCCAAAACAACTACGACGGACAAGACGAGAGACGGATAGAGGAGCTCCACAAAAGAAGGAACTTCTTGGCCGCCTACTGCAAGCTCATCGTGTACAACGTGGCACCGCTGAGGCGCGCCGCAGAGGTCTTCAAACACTACATACGGTGCTACAACGACTACGGAGACATCATCAAGGCCACGCTGAGTAAAGCTCGGGAGATCAACAAGCTGGGCTGCGCGCTCACCATGCAGCTCGCCATGCAGATGCTGTTCACGGACGTGCTGCGGCTCCACCCGCGACCCTCGCGACAACTCACCGAGTTCCTGGAGGTCAAGGAGCTCGCCAAGCGGTTCGCGGTCATGTTCGGGCTGGACGCCGTCAAGAACCGCGAGGCCCTCACGGCGCTACACCGCGCCGGCGTCGCCTTCGCCGCTCTCGAGGGCCCAGGCCCCGGCCCGCCGCCCAATCTTCTGTTCCTAGAGCCACTGGCCGAGTTCTCCGCCAAACTGCTCCGTCAGGACAAGCGTGCCGTGCTCAAGTTCGCAGAGACCAAGTTCTCGAGCATGCAGTGGGGCGAAGAGTGGGCGCCGCTGCTCGCCTACAGGAACTCGCTGCTCACGGACGCCCCGGACGAGAGGCCGCCGCCCGCCAGGAGACACTACACGAGACGCACGCGTGGTGGAGGAGGTGGCGCGGCGCCCGAGTCTGACGATGCCGACGACCCGCACTACTCAGACCCCGAGATATGA

Protein sequence:

>DPOGS212366-PA
MHRRGGKRIRMDDPPPEYVNPMTPATPMTDYGGQSVHEPETPSINYAGFNTGTVANSTVGESNREDIEEHQEEETRSPSPAPTKRVTRSRGRGGDGGYVGRLAESPPPPPLRRRGRGGRGRGRGRGAPPPQAYSPPPVLLPGDDENSLYNILRFNKTAINQVVDMWIEEYKSNRESALVQLMQFFINSSGCRGKVTPNMAQMDHTLIIKKMTQEFDEESGEYPLIMSGHTWKKFRSNFCEFIQTLVKMCQYSIIYDQFLMDNIISLLTGLSDSQVRAFRHTATLAVMKLMTALVDVALLTSVNCDNCLRQYEAERLKARDKRASERLEVLVAKRQELEENMEEIKNMLSYMFKSVFVHRYRDTLAEIRAITMSEIGIWMEKFPAHFLDDLYLKYIGWTLHDKVGEVRLRCLQALQPLYECEELKSKLELFTSKFKDRIVSMALDKETEVAVHAVRLVIAILKMHPDVLTDKDCENVYELVYSSWRSVAAAAGEFLNVRLFRPDDPGAPPARSRRGKQRLPNTPLVRDLVQFFIESELHEHGAYLVDSLIESNPMMKDWECMTDLLLEEPGPTEEPLDNRQESSLIELMVCCVRQASTGEPPVGRGASRKQHQALSKDQAKAANDDRVKMTAHFMVALPALLDKFCADPEKLNNLVTIPQYFDLELYTTQRQEGNLTLLLNKIREIVSTHTEAEVLETCGRTLEYLCSEEHAVYTRCNVARATLTDMCVNRYKEAIDDYRNLIEGGETPDADEVFNVINSLRKVSIMYMCHNLNDTNIWDSLFEDLPKCVSPGLMPTQALVYVVRACFYSVLWSLHELDERGGDPAPLRERLLAYAAHCRNIVAAGATPDLKEEAYTSLCDLLIFFAECPRGGSSAPGAGLRALEADSATMDLLNAFVQEFVFVQNNYDGQDERRIEELHKRRNFLAAYCKLIVYNVAPLRRAAEVFKHYIRCYNDYGDIIKATLSKAREINKLGCALTMQLAMQMLFTDVLRLHPRPSRQLTEFLEVKELAKRFAVMFGLDAVKNREALTALHRAGVAFAALEGPGPGPPPNLLFLEPLAEFSAKLLRQDKRAVLKFAETKFSSMQWGEEWAPLLAYRNSLLTDAPDERPPPARRHYTRRTRGGGGGAAPESDDADDPHYSDPEI-