Monarch geneset OGS2.0

DPOGS210851
TranscriptDPOGS210851-TA1980 bp
ProteinDPOGS210851-PA659 aa
Genomic positionDPSCF300027 + 519496-523732
RNAseq coverage290x (Rank: top 38%)
Annotation
HeliconiusHMEL0085202e-12359.15% 
BombyxBGIBMGA006979-TA5e-17250.77% 
DrosophilaCG4751-PA3e-0627.07% 
EBI UniRef50UniRef50_D6X3848e-3734.40%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X384_TRICA
NCBI RefSeqXP_972498.12e-3734.40%PREDICTED: similar to myb-like, SWIRM and MPN domains 1 [Tribolium castaneum]
NCBI nr blastpgi|910905843e-3634.40%PREDICTED: similar to myb-like, SWIRM and MPN domains 1 [Tribolium castaneum]
NCBI nr blastxgi|910905842e-3334.40%PREDICTED: similar to myb-like, SWIRM and MPN domains 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055158.8e-06protein binding
KEGG pathway 
InterPro domain[432-527] IPR0005558.8e-06Mov34/MPN/PAD-1
Orthology groupMCL25294 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210851-TA
ATGGCCGACGACGACGAGATTGACATTCTTGGTGATTTTTCATTTAATTCTTGTTTTGCCCAAAATAATCAGGGAATTCCTTCTTGCTCCAACAGAGAAGACACCGTGCACCCTCAATGGCTTCTGGATTCCCCTCCAACAAATTGGTATGATACACAGAATAAAGATAAAAGTTATAGGCCCAAAGATGGACCATCAAGGAAGCTATCAGGAACAACAGCAAATTATCAGCATACAACGGTCCATACATCCTGGACTCAGAAGGAAAGAGATTTGCTGGCACAAGAAATGGCCAGGTATGGGAGAAATGTGAACAAAATATCCAAAGCACTAAAAACAAAAAGTGAATTAGAAATTCAAGCTCTCATAGAAGCAGAGCACGGCATTCTATTGGAGACGGAAAATATTAAAACACCCGCAGTGAAACCTGACAACATACCCACAGTAGCACAGGAGGAAAAAATATCTAACTGTGGTAATGTGGATCTTGTGGTTAATAACAACACAGAAGAATGTGAAACAGCACCTGTGCCAAGAAAATGTTCAAAAATGAAGAAATCACACAAAAATATCAAAGAAATTGATAGCACCATTGAAACAAATCCACTGATTGGCTCCGAAATATTCTATGACGATGATTTAATTATAGGATCGACAGAGTCCATCGGTTCCGAGTTAGATGTGACAGATGTTGTAGCAACGAGTCTTACCAAGCAGCAAAGAGACAAAACGAAAGTGTTAAGGAAGAATGGAAACCACAGAAGAAAAGTGTCCAGAAACTTCGATAGAAATAGGAGCAAGGATTTTCTTAAATCACCACATAGAAGAAAAAAAGATTCCAGCTTGTCAGATGATAGTGTGAAAAGTCCAAAGATGCAGATTGTTCTGGGCTCTGGGCTGGCTCTGCCTGTGTCAGAAGGTGAAGAAGTGATAAAAATAGAGAAGAAGCCCGACTTAGATGGTGAAAGTGATATAGAAGTGGATGTAGGCAGTGATTCTGATAAAGATATATATATACCAAAAAATAAAACAGTCAAAGAGGTTGTTCACGAAGAGGTTCCAGTTGCTGTGCCATTGAGAAAATTTGAACCCATGCCCAGAAGAAATCGGAAAATTAACTTAGACGGCGGTGGTGGTTACACGATAATGCACACGGAAGCTGGTGACATGTATGAGATAGGTCAAGAACCTCGGAAAGAGAGACAGCAAAGAAAACAAGCGGTCCAACTTATACCGTTGCATGTTTATAACTCTGAGAAACCGGCGCCGTGTGCCGTGCACATGTTCGTGTCGGTGTTAGTGAGTATGGACGTGCAGGCTCACTGCAGCAGGGCGGAGGTGATGGGTCTGACGGGAGGCAGCTGGGAGCCCGGACCACGAACACTCACGCTGCAGCTGTACAGGACTGTGCGGGCCGCCGCCGCACACACGCACTGCGACATGGACCCGGTGTCCCAGTCGTCGTCGGCGGAGTCCCTCCGGTGTCGTGGTGTGAGTGTGTGTGGTTGGCATCACTCCCACCCCCAGTTCCCGCCCTCTCCGTCCGTGAGAGACCTCGTCAGTCAACGCTCGCTCCAGAGCCTCGCCTGGGGTCTGCCGTGTGTGGCGCTGGTCACCTCCCAGCACTGGCCTCCCGGACGCAGAGCCTCGCAACTCAGATGTTTCCGTGTAGAAGAGGACGACAAGCTTGACACTCCGGAGGTCCCCGCGGGCTACCAGCTCAATGTGAAGTTGGAGCGTGACCTGGACCGGAGCACCCTCGACCAGTACTTGGAGGAGCTCCGTGTCCTGGCACACGACACGCTCGCACACGTGGAGCTGCCCGTGGACGTGACACGGGACGTGTGTCCTCAGGCCGGCATCACTTACATGGAGAAGTGTCTTTCAAGTGTGAGTCACCACATGCGGTCGGCCGGCTACGAAGACGAGGATCCCATAGTCGCTCGGCTGTTACAAGGAATTAGAGATATATTCAGATAG

Protein sequence:

>DPOGS210851-PA
MADDDEIDILGDFSFNSCFAQNNQGIPSCSNREDTVHPQWLLDSPPTNWYDTQNKDKSYRPKDGPSRKLSGTTANYQHTTVHTSWTQKERDLLAQEMARYGRNVNKISKALKTKSELEIQALIEAEHGILLETENIKTPAVKPDNIPTVAQEEKISNCGNVDLVVNNNTEECETAPVPRKCSKMKKSHKNIKEIDSTIETNPLIGSEIFYDDDLIIGSTESIGSELDVTDVVATSLTKQQRDKTKVLRKNGNHRRKVSRNFDRNRSKDFLKSPHRRKKDSSLSDDSVKSPKMQIVLGSGLALPVSEGEEVIKIEKKPDLDGESDIEVDVGSDSDKDIYIPKNKTVKEVVHEEVPVAVPLRKFEPMPRRNRKINLDGGGGYTIMHTEAGDMYEIGQEPRKERQQRKQAVQLIPLHVYNSEKPAPCAVHMFVSVLVSMDVQAHCSRAEVMGLTGGSWEPGPRTLTLQLYRTVRAAAAHTHCDMDPVSQSSSAESLRCRGVSVCGWHHSHPQFPPSPSVRDLVSQRSLQSLAWGLPCVALVTSQHWPPGRRASQLRCFRVEEDDKLDTPEVPAGYQLNVKLERDLDRSTLDQYLEELRVLAHDTLAHVELPVDVTRDVCPQAGITYMEKCLSSVSHHMRSAGYEDEDPIVARLLQGIRDIFR-