Monarch geneset OGS2.0

DPOGS203173
TranscriptDPOGS203173-TA1311 bp
ProteinDPOGS203173-PA436 aa
Genomic positionDPSCF300035 - 598512-606759
RNAseq coverage111x (Rank: top 59%)
Annotation
HeliconiusHMEL0057923e-2986.57% 
BombyxBGIBMGA011022-TA0.086.85% 
DrosophilaSu(var)3-3-PA5e-11750.61% 
EBI UniRef50UniRef50_O603417e-13359.01%Lysine-specific histone demethylase 1A n=98 Tax=Metazoa RepID=KDM1A_HUMAN
NCBI RefSeqXP_002410496.16e-14060.54%lysine-specific histone demethylase, putative [Ixodes scapularis]
NCBI nr blastpgi|2416536111e-13860.54%lysine-specific histone demethylase, putative [Ixodes scapularis]
NCBI nr blastxgi|2416536113e-13460.34%lysine-specific histone demethylase, putative [Ixodes scapularis]
Group
Gene OntologyGO:00551142e-44oxidation-reduction process
GO:00164912e-44oxidoreductase activity
GO:00055151.7e-19protein binding
KEGG pathway 
InterPro domain[120-402] IPR0029372e-44Amine oxidase
[5-105] IPR0119913.3e-20Winged helix-turn-helix transcription repressor DNA-binding
[1-107] IPR0090571.7e-19Homeodomain-like
[7-92] IPR0075264.7e-15SWIRM
Orthology groupMCL12839 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203173-TA
ATGTTAAATAGCTTGGAAGGTGCTGCATTTCAATCTAGACTACCATTTGATAAAATGTCTTCATTAGAGGCCGAATGTTTTTCGGATTTGCAAGGAGCATCAAGCCTTGCACTTGTTCAAATAAGAAATCGAATATTGAAACTTTGGTTTGAAGATCCTAAGAAACAGCTCACTCAAGAACAGTCCGTTCAAAAAATGGAACCACCATACAGTGCAGATCCTGCCTTAGTGATGAGAACTCATGCTTTTCTGGAACGACATGGTTTTATAAATTATGGTATTTATGAACGAGTAAATCCTATTCCATCACCACCAAAGGGTAAACAAAGACCCAAGGTCATAATTATTGGTGCAGGAGTATCAGGCTTGGCAGCTGCACGTCAGTTGCAATCCTTTGGATGTGAAGTAGTAATATTGGAAGGGAGAGACAGAGTTGGAGGCAGGGTTGTCACTTACCGGAAGGGACCTTATGTTGCTGACTTAGGAGCAATGGTGGTAACTGGATTGGGTGGTAATCCTGTTACAACGTTGTCAGTACAAATGAACATGGAGTTGCATAAGATAAAACAAAAATGTCCATTGTATGAAGCGACCGGTAATCAAGTTGAGCGTGAATTCAATCGCTTGCTGGATGCTACCTCATACCTATCACATCAAGTAGACTTTAATTATTACAACGATACTCCCGTGTCTTTGGGACAGGCTTTGGAATGGGTCATTAAATTGCAGCAAAAGAGTGTTAAACATAAACAGATTCAACACCTTAAAGCATTGATAACCTTGCAAGAGAAGCTAAAGAATAATATCAACAAGATGTCAGATATAAACGAACTCCTTAAGTCTATGAAATCTGAGAAGGAGACGTTGTTGGCTGATCGCGAGAAGGTAGCGTCTGGAGACGCTAGTATCTTGCAGGAGTTTAATCTTAGAAGATTAAACCGGGAGATGAATCTGCTGTGTAACGAGTATGAGACGCTCACCACACAGAATAATACTATAGAGGATAAGTTGGTGCAGTTGGAATCTAATCCGCCGAGTTCGGTGTACCTCTCGGTTCGAGATCGTCAGATTCTTGACTGGCATTTCGCTAACCTGGAGTTCGCGAACGCAACGCCTTTAGGAAACCTGTCGCTGAAGCACTGGGACCAGGACGACGACTTCGAGTTCACTGGGAACCATCTAACAGTTCTTCAAAATTTACGACATCGTTGTCATTTGAAGCGAGCTTTTGCATCTCGGCAATCTGTGAAGGGTATGGAACACGAGGACATCAATCTGGTTGCTGCGAAGGCGTGGGGCGTGGATGAATAA

Protein sequence:

>DPOGS203173-PA
MLNSLEGAAFQSRLPFDKMSSLEAECFSDLQGASSLALVQIRNRILKLWFEDPKKQLTQEQSVQKMEPPYSADPALVMRTHAFLERHGFINYGIYERVNPIPSPPKGKQRPKVIIIGAGVSGLAAARQLQSFGCEVVILEGRDRVGGRVVTYRKGPYVADLGAMVVTGLGGNPVTTLSVQMNMELHKIKQKCPLYEATGNQVEREFNRLLDATSYLSHQVDFNYYNDTPVSLGQALEWVIKLQQKSVKHKQIQHLKALITLQEKLKNNINKMSDINELLKSMKSEKETLLADREKVASGDASILQEFNLRRLNREMNLLCNEYETLTTQNNTIEDKLVQLESNPPSSVYLSVRDRQILDWHFANLEFANATPLGNLSLKHWDQDDDFEFTGNHLTVLQNLRHRCHLKRAFASRQSVKGMEHEDINLVAAKAWGVDE-