Monarch geneset OGS2.0

DPOGS216111
TranscriptDPOGS216111-TA1932 bp
ProteinDPOGS216111-PA643 aa
Genomic positionDPSCF300182 - 12786-18880
RNAseq coverage719x (Rank: top 18%)
Annotation
HeliconiusHMEL0107201e-18070.15% 
BombyxBGIBMGA009228-TA2e-16568.07% 
Drosophiladalao-PA2e-9158.36% 
EBI UniRef50UniRef50_E2C4416e-10467.21%SWI/SNF-related matrix-associated actin-dependent regulator chromatin subfamily E member 1 n=7 Tax=Coelomata RepID=E2C441_HARSA
NCBI RefSeqXP_395543.36e-10568.11%PREDICTED: similar to dalao CG7055-PA [Apis mellifera]
NCBI nr blastpgi|3287804902e-10467.53%PREDICTED: hypothetical protein LOC412077 [Apis mellifera]
NCBI nr blastxgi|3504135251e-12049.70%PREDICTED: hypothetical protein LOC100744180 [Bombus impatiens]
Group
Gene OntologyGO:00055153.2e-21protein binding
GO:00036772.7e-18DNA binding
KEGG pathway 
InterPro domain[113-205] IPR0090713.2e-21High mobility group, superfamily
[128-208] IPR0009102.7e-18High mobility group, HMG1/HMG2
Orthology groupMCL13870 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216111-TA
ATGGCTACAACAAACAGCTACAAACAAAACACATCAATGTCGATGCCAAGCCCTCAGAACAATACGCAATACATGATTGCGGGGCCCCCAATGAGTTTCGGTATGATGAAAGGCGGTATGAATGCACCGCCGCACCATAACCACGTGTACCACCCTCAATACCAGGGTGGGGGGGGCCTGGGTCCCCCCGGGGTTCATGGGGGGTACGGTTGGCAACACTCGCCGCGTTTCGCTGCAGAGAGCGCTCGCAGGGCTGCCGGTGCGGCCTCGCAGCCCGGGAAAGATGACAAGACGAGCCCATTCGTGTCAACTATACATTCACATCCCGGATTCCAACCACAGAAGATTGGCAAAGGAACTGGTGGGGGTGCTGGACTGCCAAAACCACCGAAGCCACCAGAAAAGCCGCTTATGCCATACATGAGATATTCAAGAAGGGTTTGGGACAGTGTGAAGGCAGCAAACCCAGATCTAAAGCTATGGGAAATCGGCAGAATTATCGGTGGTATGTGGAGAGATCTGCCCCAGTCAGAAAAGTATGCTTTTGTGGATGAATACGAAGCTGAGAAGGCGCAGTACACAGAAATGTTGAAAGCATACCAATCATCGCCCGCATACTTACAGTGGCTGGCTCACAAAAACAAAGGTGATCTTTCGGAATACGAGAAGAGTTTGAAGACATACCACAATTCCCCGGCGTACCTCGCATACATCGCCGCCAAGAACAAGGCTGTTGTTGGGAATCTGGAAGAAGAGAGTTCAAGCAAGAAGGGAGGTTCACAGAAGGAGTCACAGCAGCAAGACAGAAGAATTGACATCCAACCAGCTGAAGATGAAGACGATCAGGACGAGGGTCTGTCAGTGAAGCACGTGGCTTACGCCCGCTATCTTCGTAATCACCGTCTCATCAACGAGATTTTCTCCGACACGGTGGTGCCAGATGTAAGGTCGGTGGTCACCACCGCTAGGATGCAGATTTTAAAGAAGCAGGTACAATCGCTGACGATGCATCAGAAGAAATTGGAAGACGAACTTCAACAAATTGAAGAGAAGTTTGAAGCAAAGAAAAGGAAGTTCATAGAGAGCAGCGAGTCCTTCCAGGAGGAGTTGAAGAAGCATTGTAAGCCAGCGGTGGACGACGAGACGTTCGCTAGAATGGTGGAGCGAGCGGTCGATCAGCAGAGAAGGGGTGCTGCTGGACCACACCCACACCCGAATCACCCACACCCCTCGCACCCTCACGCGACTCACCCTACGCATCCTTCACACCCGACACATCCCCACCACACGGCACCACAACACGCTCAGCCGGCACACCCTCGGCCTCATGAGCCAATGGATCAAGATCCGCCCAAACCAGAGGCGAACGGCCCCAACGCCCCCACTCAAGTCGACAAAGTGTCTACCACGGAGGGGAAACAGGACACAAACATCACCACGGAGTTGAAGGCTGATGAGAAACAGATCCAGGAAGGAGAAGGTGATCGTAAAGAAGATAAGCCGATTGAAATGAAAGTCGAGAGTAAGGAGGGGCCTCCGCCGGCCGGGCCGCCACAGCCACAGATCACATCGCCCCCACACCCAGCTATGATGTTACCTCCGGGAGCTAGCGGTCCGCCACCCGGAGCCAACCACGCCCCTCCACCTCCCGGTGGTTATGGCTCAGCGCCGTACGGTGGACGTTACTATCCAGGCTACGGTGTGGGAGGAGTAGGTCCCGTGGGTCCTGTGGGCGGTGTCGGACCTGTGGGCGCTGTTGGACCCGTGGGCGCTGTCGGACCCGTGGGCGGTGTGGGCCCAGTGGGTGGCGTCGGCGGTTTCCCTCCATACTCGCACTACTTCCCAGCGGAGCACTATTCACACCACCCACAGCACCCCGCTCATCATGACGTTAAGCCTGAGGAACCGCCGGCCAAGAAGGAGTCCGAATGA

Protein sequence:

>DPOGS216111-PA
MATTNSYKQNTSMSMPSPQNNTQYMIAGPPMSFGMMKGGMNAPPHHNHVYHPQYQGGGGLGPPGVHGGYGWQHSPRFAAESARRAAGAASQPGKDDKTSPFVSTIHSHPGFQPQKIGKGTGGGAGLPKPPKPPEKPLMPYMRYSRRVWDSVKAANPDLKLWEIGRIIGGMWRDLPQSEKYAFVDEYEAEKAQYTEMLKAYQSSPAYLQWLAHKNKGDLSEYEKSLKTYHNSPAYLAYIAAKNKAVVGNLEEESSSKKGGSQKESQQQDRRIDIQPAEDEDDQDEGLSVKHVAYARYLRNHRLINEIFSDTVVPDVRSVVTTARMQILKKQVQSLTMHQKKLEDELQQIEEKFEAKKRKFIESSESFQEELKKHCKPAVDDETFARMVERAVDQQRRGAAGPHPHPNHPHPSHPHATHPTHPSHPTHPHHTAPQHAQPAHPRPHEPMDQDPPKPEANGPNAPTQVDKVSTTEGKQDTNITTELKADEKQIQEGEGDRKEDKPIEMKVESKEGPPPAGPPQPQITSPPHPAMMLPPGASGPPPGANHAPPPPGGYGSAPYGGRYYPGYGVGGVGPVGPVGGVGPVGAVGPVGAVGPVGGVGPVGGVGGFPPYSHYFPAEHYSHHPQHPAHHDVKPEEPPAKKESE-