Monarch geneset OGS2.0

DPOGS214365
TranscriptDPOGS214365-TA1065 bp
ProteinDPOGS214365-PA354 aa
Genomic positionDPSCF300020 + 720771-723997
RNAseq coverage371x (Rank: top 32%)
Annotation
HeliconiusHMEL0063560.095.47% 
BombyxBGIBMGA003979-TA2e-13370.45% 
DrosophilaCG4203-PA2e-11859.03% 
EBI UniRef50UniRef50_B4NKT15e-11858.36%MAU2 chromatid cohesion factor homolog n=5 Tax=Drosophila RepID=SCC4_DROWI
NCBI RefSeqXP_002428551.11e-15182.85%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3838622479e-15174.15%PREDICTED: MAU2 chromatid cohesion factor homolog [Megachile rotundata]
NCBI nr blastxgi|3838622472e-14473.86%PREDICTED: MAU2 chromatid cohesion factor homolog [Megachile rotundata]
Group
Gene OntologyGO:00054884.7e-09binding
KEGG pathway 
InterPro domain[7-316] IPR0194402.8e-67Cohesin loading factor
[56-245] IPR0119904.7e-09Tetratricopeptide-like helical
Orthology groupMCL13444 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214365-TA
ATGGCGTCAACGCAGGACGCTTGGTATATTTCTTTACTTGGTTTAGCGGAACATTTCCGTACATCAAATCCTCCTGACATAAAAAGTTGCATTCAGTGTCTTCAAGCTGTATTTAATTTCAAACCGCCACAAAGAGTCGAGGCCAGAACACATTTACAACTTGGAAATATCCTTTTGACACACACTAAAAACATCGACTTGGCGAGAACGCATTTGGAACAGAGTTGGTGCTTATCTCAGACTATTACTGGATTTGATGATGTTAAGTTTGAGGCAGCGAGTGTGCTCGCGGAGTTATTCGAGCAGCAGGGCCAACCAACTCATTCGAAACCTATATTACGAAAAGCTATTGAATTATCACAGCACAGCGTTTACTGGCACTGCAGACTAATATTCCAGTTGGCACAAATTCATGCAACAGAGAGAGAATATGAAGTAGCTAGTAGTTTGCTTGGTGTTGGTGTGGACTATGCACAGATTTCCAATGCAGCATATACTAGAGTACTATTTCTACTCAGTAGGGTTATGTTACTATTAATAGACAAGAAAATCCAGGAAGTATTACCGTTATTGAACCAGGCCGGTCATCTTGTTGAGACATGGGCCGGTAGTCCTCACCAGAAAGAATATCTTAAAGTATTTTTCCTTGTGCTGCAGGTGTGTCATTATTTGATGGCTGGTCAAGTGAAGAGTGTGAAACCATGTCTGAAACAATTACAGCAGAGTATTCAGACTATCATGGCTCCGACCTGGCCCGACGATGATGCGGTGTGCGGGAGTGCCTCGGGGGAGTCATTTGTGTGGTTGTCAAGACAGCAACTGTATGTGTTGGTGTACCTCGTGACAGTCATGCATTCAGCTCAGGCTGGTTATATGGACAAAGCTCACAAGTACACGGAGAAGGCTCTCGCCCAAATAGACAAGTTGACCTCGAGTGAGGAGGCGAGCGAGGGGAGCGGGGCGGGCGCGGGCGGCTCGGGCTCGGTGCGCGGCTGTGCGGCGCTCGCCTGGAGGCTGCGGATGGCGCTGCTGGAGCACGCCGCCCTGTGCCGGCTCAATAGATAG

Protein sequence:

>DPOGS214365-PA
MASTQDAWYISLLGLAEHFRTSNPPDIKSCIQCLQAVFNFKPPQRVEARTHLQLGNILLTHTKNIDLARTHLEQSWCLSQTITGFDDVKFEAASVLAELFEQQGQPTHSKPILRKAIELSQHSVYWHCRLIFQLAQIHATEREYEVASSLLGVGVDYAQISNAAYTRVLFLLSRVMLLLIDKKIQEVLPLLNQAGHLVETWAGSPHQKEYLKVFFLVLQVCHYLMAGQVKSVKPCLKQLQQSIQTIMAPTWPDDDAVCGSASGESFVWLSRQQLYVLVYLVTVMHSAQAGYMDKAHKYTEKALAQIDKLTSSEEASEGSGAGAGGSGSVRGCAALAWRLRMALLEHAALCRLNR-