Monarch geneset OGS2.0

DPOGS215996
TranscriptDPOGS215996-TA3078 bp
ProteinDPOGS215996-PA1025 aa
Genomic positionDPSCF300078 + 126208-133307
RNAseq coverage330x (Rank: top 35%)
Annotation
HeliconiusHMEL0040080.088.27% 
BombyxBGIBMGA000930-TA0.088.07% 
DrosophilaIswi-PC0.075.15% 
EBI UniRef50UniRef50_Q243680.075.15%Chromatin-remodeling complex ATPase chain Iswi n=96 Tax=Opisthokonta RepID=ISWI_DROME
NCBI RefSeqXP_396195.20.078.56%PREDICTED: similar to Imitation SWI CG8625-PA, isoform A isoform 1 [Apis mellifera]
NCBI nr blastpgi|3071915250.078.06%Chromatin-remodeling complex ATPase chain Iswi [Camponotus floridanus]
NCBI nr blastxgi|3072115420.078.11%Chromatin-remodeling complex ATPase chain Iswi [Harpegnathos saltator]
Group
Gene OntologyGO:00036774.7e-102DNA binding
GO:00055244.7e-102ATP binding
GO:00055154.3e-50protein binding
GO:00056346e-42nucleus
GO:00063386e-42chromatin remodeling
GO:00168186e-42hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
GO:00036766e-42nucleic acid binding
GO:00043864.3e-26helicase activity
KEGG pathway 
InterPro domain[133-411] IPR0003304.7e-102SNF2-related
[855-982] IPR0090574.3e-50Homeodomain-like
[126-318] IPR0140013.8e-42DEAD-like helicase
[857-971] IPR0151956e-42SLIDE
[463-547] IPR0016504.3e-26Helicase, C-terminal
[800-849] IPR0010053.7e-06SANT domain, DNA binding
Orthology groupMCL10951 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215996-TA
ATGTCTCAAGCTGAAGAACCTATGGAAGCAGCAGATGCTGGCGATAATTCTAATGGCTCTTCAAGTGATACACCGTCATCTAGAGGCAAAGAAGGCGATTTTGAAAGCAAAATTGAAACTGATCGTGCGAAGAGATTCGATTTTTTGTTAAAACAAACGGAAATTTTCTCACATTTCATGACAAACACTCCCAAATCTGGAAGTAGTCCTCCTAAACCCAAAGCTGGAAGACCAAGAAAGATTAAAGAACCAGAGCCTGAAGCTGGAGACCACCGACATCGCAAAACAGAACAAGAGGAAGATGAAGAGCTTCTTGCCGAAACAAATACAAAACAGAAAACCATTTTCCGATTTGAATCATCACCTCCATATATAAAGAATGGTGAAATGAGAGATTACCAAGTTAGAGGCTTGAATTGGATGATATCTCTCTATGAAAATGGTATCAATGGTATCCTAGCCGATGAGATGGGTTTGGGAAAGACATTACAGACTATATCACTCTTAGGATATATGAAAAATTTCAAAAATGTTCCTGGGCCTCACATAGTTATTGTTCCAAAATCTACATTGACCAATTGGATGAATGAGTTCAAGAAGTGGTGTCCATCGCTCCGAGCTGTGTGTCTCATTGGAGATCAAGAGACCAGGAATATATTTATAAGGGAGACACTAATGCCTGGAAATTGGGATGTTTGCATCACGTCATACGAAATGATCATACGTGAGAAGTCCGTCTTCAAGAAATTTAACTGGAGGTACATGGTCATAGATGAAGCTCATCGTATCAAGAACGAAAAATCTAAACTGTCAGAACTGCTACGTGAGTTCAAGAGCATGAACAGACTGCTGTTAACTGGAACACCATTACAGAATAATCTTCATGAGCTCTGGGCTTTACTCAACTTTTTGTTGCCCGATGTGTTTAATAGTTCTGATGATTTCGATGCATGGTTCAATACAAATGCCGCACTGGGTGACAATCAGCTCGTGTCTCGATTGCATGCCGTGCTGAGACCGTTTCTATTGAGACGTCTCAAGGCCGAAGTGGAAAAGAAGCTGAAACCTAAGAAAGAACTTAAAGTGTACATAGGTTTGAGTAAAATGCAAAGGGAATGGTACACTAAAGTGTTGATGAAGGATATAGATGTTGTAAACGGCGCTGGTAAAGTAGAGAAGATGAGGTTACAAAATATTCTTATGCAGCTGCGTAAGTGTTGCAACCACCCTTACTTGTTCGATGGCGCGGAACCCGGCCCTCCGTACACCACGGATGAGCATCTAGTATACAACTGTGGAAAACTAGCGATATTGGATAAGCTGTTGCCTAAATTACAAGAACAAGAATCGAGAGTGCTCATATTCTCACAAATGACGAGAATGTTGGATATCTTGGAGGATTATTGTCTTTGGAGGCAATATAAATACTGCCGTCTAGACGGCCAAACACCTCATGAAGATAGGAACAGGCAAATCGAGGAATACAATGCTGAAGGCAGCGAGAAATTTATATTCATGTTGTCAACTCGTGCTGGAGGTCTGGGTATTAATTTGACAACAGCTGATGTGGTCATTATATATGACTCAGATTGGAACCCACAGATGGACTTGCAAGCCATGGACAGAGCTCACCGTATAGGGCAAATGAAACAAGTACGTGTATTCCGTCTTATAACTGAGAACACGGTCGAGGAGAAAATAGTTGAAAGGGCAGAGGTCAAACTGCGTCTGGACAAGCTGGTTATACAGTCAGGTCGTCTCGTTGATATTAAGAACCAACTTAATAAAGACGAAATGTTAAACATGATCAGACATGGCGCCAACCACGTCTTTTCATCTAAGGACTCTGAGATCACCGACGAAGACATTGATAGTATTCTCGCCAAGGGAGAATCTAAGACTGAGGAACTGAAACAGAAGTTGGAGAGTCTCGAAGAATCGTCGTTACGATCATTTTCCATGGACACACCCGGAGCAACTGATTCTGTATATCAATTTGAAGGTGAGGATTACAGAGAAAAACAGAAGATAGTTCCGATCGGTAATTGGATAGAACCACCTAAAAGAGAACGTAAAGCTAACTACGCGGTTGACGCGTACTTCAGAGAGGCACTTCGCGTCTCTGAACCGAAGGCGCCAAAGGTACAGGTAAATAAAGTCCCCGTTCTGGTGTTGGTTTTTAAATTTATGCATCCTATATTATATTACATTAGCTCTTTATGCTCGAATCTCTGTATACGTTGTTCGCTAACATTATATGAGAGGCAAGAGACTCCAACTTCAGACCTTGGCGCCCCCCGAATTTGTCGCCCTGGGCCATCGCCCCATCACGCCCCCCCCAACGCCGGCACTGATTATGTAACGAGTTGTAGCCGGTTAATTAATCGTAAATTTTCAGGTTTCACAAACTGGACAAAACGAGACTTCAATCAATTTATAAAGGCTAATGAAAAGTATGGAAGAGATGATATAGAAAATATCGCTAAGGATGTTGAAGGGAAGACGCCAGAAGAGGTCATGGAATACTCAGCAGTTTTTTGGGAAAGATGTCATGAACTTCAAGATATTGATAGAATTATGGGTCAAATTGAAAGAGGCGAAGCTAAGATACAGAGGAGGGCGTCGATCAAAAAAGCTTTAGATGCCAAAATGGCGCGATATAGGGCGCCCTTCCACCAACTTAGAATATCATATGGAACAAATAAGGGGAAAAATTATGTAGAAGAAGAGGACAGATTCCTCGTTTGCATGCTGCATAAATTGGGTTTCGATAAGGAAAATGTCTATGAAGAACTCAGAGCCGCTGTACACGCGGCTCCTCAATTCCGTTTCGATTGGTTCCTTAAATCACGTACAGCCGTTGAATTACAACGAAGATGCAATACTCTTATAACATTGATCGAGAGAGAAAATCAAGAATTGGAAGAAAAAGAACGCGCTGAAAAGAAAAAGAAGAGTGGTAACTCTAATCAAAATACCCCAGGAGGTAATGCTACGGGTAAAGGTGCCAATTCAGGCAAACGTAAGGCTGAGAACACACCTGACACGACTCAGAAAAATAAGAAAAAGAAAAAATGA

Protein sequence:

>DPOGS215996-PA
MSQAEEPMEAADAGDNSNGSSSDTPSSRGKEGDFESKIETDRAKRFDFLLKQTEIFSHFMTNTPKSGSSPPKPKAGRPRKIKEPEPEAGDHRHRKTEQEEDEELLAETNTKQKTIFRFESSPPYIKNGEMRDYQVRGLNWMISLYENGINGILADEMGLGKTLQTISLLGYMKNFKNVPGPHIVIVPKSTLTNWMNEFKKWCPSLRAVCLIGDQETRNIFIRETLMPGNWDVCITSYEMIIREKSVFKKFNWRYMVIDEAHRIKNEKSKLSELLREFKSMNRLLLTGTPLQNNLHELWALLNFLLPDVFNSSDDFDAWFNTNAALGDNQLVSRLHAVLRPFLLRRLKAEVEKKLKPKKELKVYIGLSKMQREWYTKVLMKDIDVVNGAGKVEKMRLQNILMQLRKCCNHPYLFDGAEPGPPYTTDEHLVYNCGKLAILDKLLPKLQEQESRVLIFSQMTRMLDILEDYCLWRQYKYCRLDGQTPHEDRNRQIEEYNAEGSEKFIFMLSTRAGGLGINLTTADVVIIYDSDWNPQMDLQAMDRAHRIGQMKQVRVFRLITENTVEEKIVERAEVKLRLDKLVIQSGRLVDIKNQLNKDEMLNMIRHGANHVFSSKDSEITDEDIDSILAKGESKTEELKQKLESLEESSLRSFSMDTPGATDSVYQFEGEDYREKQKIVPIGNWIEPPKRERKANYAVDAYFREALRVSEPKAPKVQVNKVPVLVLVFKFMHPILYYISSLCSNLCIRCSLTLYERQETPTSDLGAPRICRPGPSPHHAPPNAGTDYVTSCSRLINRKFSGFTNWTKRDFNQFIKANEKYGRDDIENIAKDVEGKTPEEVMEYSAVFWERCHELQDIDRIMGQIERGEAKIQRRASIKKALDAKMARYRAPFHQLRISYGTNKGKNYVEEEDRFLVCMLHKLGFDKENVYEELRAAVHAAPQFRFDWFLKSRTAVELQRRCNTLITLIERENQELEEKERAEKKKKSGNSNQNTPGGNATGKGANSGKRKAENTPDTTQKNKKKKK-