Monarch geneset OGS2.0

DPOGS207050
TranscriptDPOGS207050-TA3486 bp
ProteinDPOGS207050-PA1161 aa
Genomic positionDPSCF300001 + 2074283-2083978
RNAseq coverage244x (Rank: top 42%)
Annotation
HeliconiusHMEL0104950.084.65% 
BombyxBGIBMGA012992-TA0.088.02% 
DrosophilaBr140-PA0.054.21% 
EBI UniRef50UniRef50_D6WYQ10.061.24%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WYQ1_TRICA
NCBI RefSeqXP_967270.10.061.24%PREDICTED: similar to AGAP007617-PA [Tribolium castaneum]
NCBI nr blastpgi|910878270.061.24%PREDICTED: similar to AGAP007617-PA [Tribolium castaneum]
NCBI nr blastxgi|910878270.049.32%PREDICTED: similar to AGAP007617-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055151.2e-35protein binding
GO:00082701e-07zinc ion binding
KEGG pathway 
InterPro domain[567-699] IPR0014871.2e-35Bromodomain
[129-226] IPR0195426e-23Enhancer of polycomb-like, N-terminal
[1041-1121] IPR0003136.7e-23PWWP
[230-311] IPR0110111.7e-19Zinc finger, FYVE/PHD-type
[242-294] IPR0130833.7e-11Zinc finger, RING/FYVE/PHD-type
[247-293] IPR0019651e-07Zinc finger, PHD-type
Orthology groupMCL10756 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207050-TA
ATGGGTTTGGATTTTGATGTCTTTGAATTCTGCAAAAAATTGCGTCAGAACAGGCCTCCTCCGTATCAGTGTCCACTAGAGAAATGTGATAAAGTATACAAGAGTTTATGTGGTTTGCAATATCACTTAGTAAACTATGACCATGACAATCCAACACCGGCGACACCTTCAATTGCAAGCAGTCGCAAGAAAGGCAGAACTCGAGCTGCTGTGCCTACTGGAGATATCGCACTTCAAAGCCCACCTAAGGAAGCTTTGACTTTTGCTGAGGCACAAAAAGTGGTGCAGTTTGAAGTTGATGGAAAAATTAGTAGAATACCAATTGACCAGCCACTGCCTATAGTTTCATTGGAAGAGTGGGAGAGAAAAAATGCAGATTTAGAAAAGCCTATGCCATTTGTAGAGCCACCATCAGAGCCTCATGTTAAATTACCAGAGGCTACATTCCGTCTAATACCAGATTATAATGCACGGGTATGTGACGCACCACCTCGGCCAAATGCATACATACGTTTCATAGAGAAGTCGGCTGAGGAATTGGATGGTGAAGTGGAATATGATGTTGATGAAGAAGACACAGCCTGGCTGGCCATTATTAATAAGAACAGAACCAAACAAGGTCTACCACCCGTCTCCGTAGATACCCTGGAGCTACTCATGGATAGATTAGAAAAGGAATCATATTTTCAGGCTACACAGAACGGCCAGCAACCTGCGGCAACGGTGGACGAAGATGCTGTGTGCTGCATATGCATGGACGGGGAATGCCAGAACACCAATGTGATCCTGTTTTGTGATATGTGTAACCTGGCCGTCCACCAGGATTGTTATGGGGTGCCATATATACCAGAGGGACAATGGCTGTGCAGACGTTGTCTTCAATCACCATCACGACTTGTTAACTGTGTGCTATGTCCCAACACTGGAGGAGCATTCAAGCAGACAGATCAGGGCACTTGGGCGCACGTCGTCTGCGCCCTCTGGATACCAGAAGTTCGCTTTGCAAATACAAATACATTTTATTTGTTAACAGATTCAATAGAGATGATTCCGGCTGCTCGTTGGAAGCTTCAGTGTATGGTGTGCAAGCAGCGAGGTGCTGGTGCTTGCATACAATGTCACCGTAGCAACTGTTACAGCGCCTTCCATGTCACATGCGCCCAGCAGGCCGGTTTGTATATGAAGATGGAAGCGGCCGGATCTGGCCGTGATCCCAGTCAACCAGTTCAGGTGGCCAAAATGGCGTACTGTGATGCACACACACCAGCACATGTATTACAGGAGAGGAGAGCTTTGGAGTCGGAAGGTGAAAGTAAATCTTCAGATTTGACTTCCATACGACAGAAAGGAAGGGAGAAGATAAAACAGGCTCGGAGAGTGTTAGCGTTGAAGCGTACGTGGGCGCCGGTAGTGTTGGTGCCGACATTACCACCTGAACGTGTTGCTGAGATTGCCCAACTGTCACACGGGACACCCGCGGCTAGAGCACAGCTGATGAAAAGACTTCTCGCTTACTGGACCCTCAAAAGACACAGCAGGAACGGGGTTCCACTCCTTAGGAGGCTACAAAGCCTGACCAGCCATCACGGGAGCCGAGGTATCCAAGATGGCACTGTGAATGTACGAGAACTCTGCAATCAACTCAAGTACTGGCAGCGGATAAGACAAGATCTGGAGAGGGCTAGATTGCTGTGTGAGTTGGTACGTAAACGCGAGCGTCTCAAGGCGGAATACACTCGTGTTTGGGAACGCTGTGTTTTGCATACGCTCCGACCTGAACGTGCCATGCTGAGCAAGATGCTGCGCATGATGAGACACGCTGACCACAGTGACGTGTTCACGGAGCCGGTCGACCCGCTAGAGGTTCCAGATTACAGCACCGTCGTAAAGCATCCCATGGATTTAAGTACCATGGGCAAGAAATTGGACAGAGGCATTTATAAGACCATAGATGACGTAGAGGCAGATTTCCAACTAATGATAGACAACTGCCTCACATATAATAAAAAGGATACAGTGTTTTACAAAGCTGGTGTCAAGATGAGGGAGCAGTGTACGTCTATATTTCGTCAAGCACGTCGTGACGTCATAGAGGCGGGTCTGGCGTCGCTGGCAGGGGAAGGGGACGCAGAGGAAACTTACACACCCGGGCGCACACACGCACAGAAACACACACAGTCAAGGCGCAGGAGTGTAAGAAACACAAGCAGCGACAGCGATCGTACAGCCGATACTCGAAGCGAGCGCGGCGTCTCCCTGGCACGTAGCGAGCGACGACACACCAGCGCATTGAGAGACAGTGACGACGACATTAATCAGCGCGAGCCGTCGCCGGCTAAGAGCAAGGTGAACCGCAATTGGTGGCGCGGCCGCGGTCGTGGTAGGGGCAGGAGGGGGAGGAGGGGCCGCGGGAGAGGGGGACACGTGTCGCCTCGACCACTCAGAGACAACGATACTCCGACGACGGATTCAGAAGCTCCTATAGTTAAATCGAAGACTGTTGAGCGAACACAAAAATTGGTCACACCAGAAAAGTCACCAACTAAGCAACTGGAAAGTACTGGTCTAGGACTACTGGGTGGTTTGAGAAAACCTACTTTGCTTGTGACGCCCTCAATAACCACCCCTCCGAAGAGTTTTGGTTCTGATGCCACTTTGCCAACACTATCAGCCAGCTTGGGACACACAGAACCCTCGCCCAGGAAGAAGGGTCGCGGTCGTCCACGGAAACAAGATAAAACAACAGATCTATTCAGAGGGGACTCGGAGGTCCTAGGAGGAGCATCGTTCCTTCAATACCGCGGTCCTCCCGGGGAAGTCGGCTCGGATAGCGATTTGGCACTATCAAGGTCGTCAAGCAGTAGTTCAGCGTGGTCCCAGTCATGTTCCTCGTGCACACACTATGACGACGATAGATCTGGCGATGACAGTTCCAGCGAAGGTTCTAGTTACAATGAAACGTTGGATTCGTTAGAAAGCCGCGGTCCGGAGCCCAGACGTCGCGGCCGTCGGGCCGATGAGGGAGTCGACCGCACTCAGCCAGCAACACCCGTCAAGGGCCGTGGTACAAGATCTTCTACGTCCAAGACTCCGGTGAAAGTTACCCAATCAGATGTTCTGCTTGAGCCGTTGCAGTTAGTATGGGCAAAGTGCAGAGGCTATCCTTGGTATCCCGCACTAATAATAGATCCGAAGATGCCAAAAGGTTACATATACAACGGAGTTCCTCTACCAGTGCCGCCTCAAGATGTACTGAACCTCAAGAAGAATTATGCTCACGAACCAGTATTGTACCTAGTTCTATTTTTCGACGTTAAACGAACGTGGCAATGGCTGCCTCCAAATAAATTGGAAATCTTGGGCCTAGATAAGGAGATAGATGAAGCCAAACTGGTGGAGTCACGGAAACCGACCGACAGGAAGGCTGTCAAGAAGGCTTATGGTGATGCAATGCAGTTCCGGAAGCAGGTTGACGGTGATAAATGA

Protein sequence:

>DPOGS207050-PA
MGLDFDVFEFCKKLRQNRPPPYQCPLEKCDKVYKSLCGLQYHLVNYDHDNPTPATPSIASSRKKGRTRAAVPTGDIALQSPPKEALTFAEAQKVVQFEVDGKISRIPIDQPLPIVSLEEWERKNADLEKPMPFVEPPSEPHVKLPEATFRLIPDYNARVCDAPPRPNAYIRFIEKSAEELDGEVEYDVDEEDTAWLAIINKNRTKQGLPPVSVDTLELLMDRLEKESYFQATQNGQQPAATVDEDAVCCICMDGECQNTNVILFCDMCNLAVHQDCYGVPYIPEGQWLCRRCLQSPSRLVNCVLCPNTGGAFKQTDQGTWAHVVCALWIPEVRFANTNTFYLLTDSIEMIPAARWKLQCMVCKQRGAGACIQCHRSNCYSAFHVTCAQQAGLYMKMEAAGSGRDPSQPVQVAKMAYCDAHTPAHVLQERRALESEGESKSSDLTSIRQKGREKIKQARRVLALKRTWAPVVLVPTLPPERVAEIAQLSHGTPAARAQLMKRLLAYWTLKRHSRNGVPLLRRLQSLTSHHGSRGIQDGTVNVRELCNQLKYWQRIRQDLERARLLCELVRKRERLKAEYTRVWERCVLHTLRPERAMLSKMLRMMRHADHSDVFTEPVDPLEVPDYSTVVKHPMDLSTMGKKLDRGIYKTIDDVEADFQLMIDNCLTYNKKDTVFYKAGVKMREQCTSIFRQARRDVIEAGLASLAGEGDAEETYTPGRTHAQKHTQSRRRSVRNTSSDSDRTADTRSERGVSLARSERRHTSALRDSDDDINQREPSPAKSKVNRNWWRGRGRGRGRRGRRGRGRGGHVSPRPLRDNDTPTTDSEAPIVKSKTVERTQKLVTPEKSPTKQLESTGLGLLGGLRKPTLLVTPSITTPPKSFGSDATLPTLSASLGHTEPSPRKKGRGRPRKQDKTTDLFRGDSEVLGGASFLQYRGPPGEVGSDSDLALSRSSSSSSAWSQSCSSCTHYDDDRSGDDSSSEGSSYNETLDSLESRGPEPRRRGRRADEGVDRTQPATPVKGRGTRSSTSKTPVKVTQSDVLLEPLQLVWAKCRGYPWYPALIIDPKMPKGYIYNGVPLPVPPQDVLNLKKNYAHEPVLYLVLFFDVKRTWQWLPPNKLEILGLDKEIDEAKLVESRKPTDRKAVKKAYGDAMQFRKQVDGDK-