Monarch geneset OGS2.0

DPOGS205464
TranscriptDPOGS205464-TA1554 bp
ProteinDPOGS205464-PA517 aa
Genomic positionDPSCF300166 - 105720-107616
RNAseq coverage500x (Rank: top 25%)
Annotation
HeliconiusHMEL0129960.074.37% 
BombyxBGIBMGA008274-TA0.069.57% 
Drosophilabbx-PB2e-4830.98% 
EBI UniRef50UniRef50_D2A3L17e-6135.71%Putative uncharacterized protein GLEAN_07499 n=1 Tax=Tribolium castaneum RepID=D2A3L1_TRICA
NCBI RefSeqXP_968051.11e-6135.71%PREDICTED: similar to bobby sox CG1414-PC [Tribolium castaneum]
NCBI nr blastpgi|910807713e-6035.71%PREDICTED: similar to bobby sox CG1414-PC [Tribolium castaneum]
NCBI nr blastxgi|3287826352e-6434.45%PREDICTED: hypothetical protein LOC551787 [Apis mellifera]
Group
Gene OntologyGO:00055153.2e-17protein binding
GO:00036771.6e-15DNA binding
KEGG pathway 
InterPro domain[1-66] IPR0090713.2e-17High mobility group, superfamily
[1-66] IPR0009101.6e-15High mobility group, HMG1/HMG2
Orthology groupMCL17391 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205464-TA
ATGAACGCCTTTCTCATATTCTGCAAACGTCACCGTAGCGTCGTCAGAGACAAGTATCCAAATTTAGAAAACAGATCTATAACTAAAATTCTCGGAGAATGGTGGGCCAATTTGGACCAGGAAGAGAAAGCTTGTTATACAGGTCTTGCTAAACAGTACAAAGACGCGTTCTTCAACGCCAACCCTGACTTCAAATGGTACAAGTTGCCCGCACCACCGCTACGTACACTATCCTCCCGGCCTCGGGAGTCCACTGAGAAAGCAGAATCCTCCAACGAATATAACGACCATGAACTGGAAAAAACAAATAATAATTCCACAAAATTCGACTACAATAAGAAACCAGACAGTGATCCAGAAACTGAAGCTAAACCACTATCCATGTTCACGCCTGGAAAGTTAGCCGATGAAGCTCAAATGGGAGGTCTGAGTAGCCTCCTAGCTACCAAGACTGAAGTCCAAACTCCTAATCCATATTATTCTCCTCCTTCCTGTAAATTTAACGCGATCTCAACTCCAATCGAATACAGATCTCATGATTCAATAGACAGACCCAAAACCAAACGAGAAGACAGCATACGTGAGCTTCAAAATGCACTTACAGAAACAACTCGAATGTTCGAAGAAGATTTCGATGAAAAGGAACAATTACGTTATTACGGTGCAGCTAACGACCAATTCACGAACCAAGATGTTATCGACCAAATAGTTGATAAACGCTATTCTAAAGATGATGAAGGATACCAGAGAAACTGGTCGGATGATGAGAAAAATTCCAAGTCTGGTAGAACTTGCAAAGGGAAAAGATATCAAGAGTTCATGGCTGTTGGAGGACTGATAGTGAATAAGAGGCCGAGGAGAGATTATCCCGATAGATTGTCGGACGAAGGCTACAATGCATCTTGTAGCTGGGATCCTGGATCTTCGCTCGAGGAATCAACAATGACAATGGCAGACGAGTCGACGCCCGACACTAACTATATACAGCACGACATAACAGTCGAAAGCGAACCAAATGTGGAACCAAACGACACGCCCGAAATAGACAATAACTGTAACAAATCATTTAAGGCTGCCGACTTCGATTTAGAAGCTAAAATAAGAGCTCTACCCTCCCTTAGTTTAGAGAAGTTTCAACAGAAAAAACGCGAGAATAAACGTAAGAAAAAAAACGTTAGCCTGAGAACTAAATCCGTCAAGTCATCGCAAATAATAAACTCGGTTCCGCGTCCGGTTATGGACGAGCGTCATGAAATGGCGGAGAATTGGCGCGAGACCGTCATAGGGAGCCAGAAACGGAAACCGAGGAAAATAAGCATCACACGACTCGAAATCAACAGCCTCGTCTCCAGTAACATGAACGGCGGCAATAAAATCAGTCCAGAAATAAAAATTGCCACAGAAGCTCCTTGCACCATCCAAAGCATGGACATATGCAATCAGAGCCATGGTAATGTCGACCTGTTCGCGTTGGCCACGTTGGCCGAGGTCGCTGCCAACACGTCCAAAATAGAGCAGACCAATGCGGCAAGCGAAGACGCTTCCAAGGTATGA

Protein sequence:

>DPOGS205464-PA
MNAFLIFCKRHRSVVRDKYPNLENRSITKILGEWWANLDQEEKACYTGLAKQYKDAFFNANPDFKWYKLPAPPLRTLSSRPRESTEKAESSNEYNDHELEKTNNNSTKFDYNKKPDSDPETEAKPLSMFTPGKLADEAQMGGLSSLLATKTEVQTPNPYYSPPSCKFNAISTPIEYRSHDSIDRPKTKREDSIRELQNALTETTRMFEEDFDEKEQLRYYGAANDQFTNQDVIDQIVDKRYSKDDEGYQRNWSDDEKNSKSGRTCKGKRYQEFMAVGGLIVNKRPRRDYPDRLSDEGYNASCSWDPGSSLEESTMTMADESTPDTNYIQHDITVESEPNVEPNDTPEIDNNCNKSFKAADFDLEAKIRALPSLSLEKFQQKKRENKRKKKNVSLRTKSVKSSQIINSVPRPVMDERHEMAENWRETVIGSQKRKPRKISITRLEINSLVSSNMNGGNKISPEIKIATEAPCTIQSMDICNQSHGNVDLFALATLAEVAANTSKIEQTNAASEDASKV-