Monarch geneset OGS2.0

DPOGS215187
TranscriptDPOGS215187-TA1359 bp
ProteinDPOGS215187-PA452 aa
Genomic positionDPSCF300143 - 310414-322435
RNAseq coverage44x (Rank: top 72%)
Annotation
HeliconiusHMEL0092621e-15289.97% 
BombyxBGIBMGA008666-TA3e-17375.35% 
Drosophilanab-PC5e-8652.37% 
EBI UniRef50UniRef50_D6W8092e-9953.92%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W809_TRICA
NCBI RefSeqXP_970832.14e-9953.00%PREDICTED: similar to nab CG33545-PA [Tribolium castaneum]
NCBI nr blastpgi|2700145678e-9953.92%hypothetical protein TcasGA2_TC004602 [Tribolium castaneum]
NCBI nr blastxgi|2700145679e-10454.25%hypothetical protein TcasGA2_TC004602 [Tribolium castaneum]
Group
Gene OntologyGO:00458926e-46negative regulation of transcription, DNA-dependent
GO:00056346e-46nucleus
KEGG pathway 
InterPro domain[186-339] IPR0069896e-46NAB co-repressor, domain
[14-93] IPR0069885.8e-45Nab, N-terminal
Orthology groupMCL13140 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215187-TA
ATGTGTGGCGCCCGCGGCTACAGTACACTGGTAGTTACGTCAGCGCCATCTAACGAGGCCGAGCTTCAGTTGTACCGAGTAATGCAACGAGCCAGCTTACTGGCTTACTACGACACGTTACTTGAAATGGGCGGCGATGATGTGCAGCAGCTCTGTGACGCTGGTGAAGAAGAATTCCTGGAAATAATGGCATTAGTCGGGATGGCATCCAAGCCGTTACACGTACGGAGATTACAAAAAGCTCTTCAAGAGTGGGTGAACAACCCAGCACTTTTCCAGATACCCATCGTCCCTAACCTCTGTCCCACCGAGAACCCTTTCATACAGTGCAACCAAAGACTTCTAAACATCCCACCGACCATACCAAATTTACCACGAACGGTGTCATCCCCTGACAATAATAGACCCATGTACAGCCCCTCTCCTGGAGCGCCAGAAAACAGCTGTGGGTCAAATGCCTCCGCCCCTAATGCCAGTCCAGTCCACAGTCAGGCAGCTTTCACCAAAATAGTTCTACCATCGTCTCCTGCCCCCACCGCTCCCGTTACAACCACTACCAGATCCTTTTCCCCTCAAAGCGCCTGTCCTCCTGCCTCCTCAGGATCTAGTCCCAGCTCAGTCACATCCCCCGTCCAACTAACACCAGTCCTCTTAGACATTCACGTACAAAAACTAGCTGCGTCAGCAGAAAAGTTGTGTAAGCACCTTCCGCAATTCGAACCGAAGCCGCAAAATACTAAGAAGAAAATGTGCAAGGAACTCGAGTTGGTGATGATGATGCCGGAGTCGGATCCTCGGCGAATGGAGGAGATCAGAAAGTACGCGGCCATATACGGGAGGTTCGATTGCAAGAGGAAGCCCGAGAAACCGTTGACACTTCATGAGGTGCTCGAAGTGTTAGTGAACGAAGCGGCGGCTCAGCTGTCACGATGCTCGCCAGCGCTGCTCACGAGACGCGACGAGCTGTTCCCGCTGGCCAGACAGCTCGTCAGAAGAGCCGGTCTACACTACTCTAAACATGGTCTCCCTCCTTCTGCGACAACGGAGGAACGAGAGCGTGATGACGAGGAAACTCCGAGCAAGCGACCGCGGCAGGAGAGAGAGGACAATGGTGTCAGAGCGAGCCCCGAGACAGCGCGATCCAATTACAGTTGGTGGGGAAAGACGGAGAGTGACGACGCTGATTCTCGCTGTTCATACTCCAGCACCAGCACTCCGCCTCCTGAGGGTGAGGAGCGTCCGCAGGTGGTGGCTGCTCGGGGCGACAGCATCATCGCTGTGGCCAACCCCGCGCTGGCACATCCTCCACACCCTCACCCGCCTCACCCTCACCATCCCGCCAGAAACCATTCCAACTGA

Protein sequence:

>DPOGS215187-PA
MCGARGYSTLVVTSAPSNEAELQLYRVMQRASLLAYYDTLLEMGGDDVQQLCDAGEEEFLEIMALVGMASKPLHVRRLQKALQEWVNNPALFQIPIVPNLCPTENPFIQCNQRLLNIPPTIPNLPRTVSSPDNNRPMYSPSPGAPENSCGSNASAPNASPVHSQAAFTKIVLPSSPAPTAPVTTTTRSFSPQSACPPASSGSSPSSVTSPVQLTPVLLDIHVQKLAASAEKLCKHLPQFEPKPQNTKKKMCKELELVMMMPESDPRRMEEIRKYAAIYGRFDCKRKPEKPLTLHEVLEVLVNEAAAQLSRCSPALLTRRDELFPLARQLVRRAGLHYSKHGLPPSATTEERERDDEETPSKRPRQEREDNGVRASPETARSNYSWWGKTESDDADSRCSYSSTSTPPPEGEERPQVVAARGDSIIAVANPALAHPPHPHPPHPHHPARNHSN-