Monarch geneset OGS2.0

DPOGS201612
TranscriptDPOGS201612-TA1188 bp
ProteinDPOGS201612-PA395 aa
Genomic positionDPSCF300152 + 401181-405666
RNAseq coverage231x (Rank: top 44%)
Annotation
HeliconiusHMEL0081070.091.98% 
BombyxBGIBMGA012202-TA2e-17080.89% 
DrosophilaChi-PB1e-13773.80% 
EBI UniRef50UniRef50_O436793e-13572.70%LIM domain-binding protein 2 n=62 Tax=Eumetazoa RepID=LDB2_HUMAN
NCBI RefSeqXP_001599552.11e-15680.60%PREDICTED: similar to lim domain binding protein [Nasonia vitripennis]
NCBI nr blastpgi|3838562537e-16085.00%PREDICTED: uncharacterized protein LOC100880826 [Megachile rotundata]
NCBI nr blastxgi|3838562536e-15981.66%PREDICTED: uncharacterized protein LOC100880826 [Megachile rotundata]
Group
Gene OntologyGO:00056341.9e-249nucleus
GO:00072751.9e-249multicellular organismal development
GO:00037121.9e-249transcription cofactor activity
KEGG pathway 
InterPro domain[74-374] IPR0026911.9e-249LIM binding protein
Orthology groupMCL10313 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201612-TA
ATGGCGCGCGAGCGCACTGAGCGGCAACACCGCACGAGGCCGACGCCGATCCGCGCTAGATGGCGCGGTAACAAAGTGGCCTACATGCCCGTGGCGCTGGGCTCGCTCGGCGAGGGCGAGTACCGGGAGTACCACCACCACTCGCCTCTTCATCATCAGAACCAGCACCAGCACCAGCCCCAGTACCATGAGTACGGAGCTATATACCATCCCATACCGGAGCCCTACTTTAGGAGGCATGCTCCGTACTTCGGACAGCCAGACTACAGAGTATATGAACTCAACAAGAGATTACAACAGAGGACCGAGGACTCCGACAACTTGTGGTGGGATGCCTTCGCCACTGAGTTCTTCGAGGACGACGCGACGCTCACACTGACATTCTGTTTAGAAGATGGACCCAAGAGATACACAATAGGAAGAACCTTAATACCTCGCTACTTCCGGAGTATATACGAAGGTGGTGTCTCCGAGCTGTACTACACCATGAGGCAGCCCAAGGAGTCCTTCCACAACACCAGCATCACGCTGGACTGCGACCACTGCACCATGGTCACCCACCACGGCAAGCCCATGTTCACCAAGGTTTGTACGGAGGGCCGCCTCATCCTGGAGTTCACCTTCGACGACCTGATGCGCATCAAGTCGTGGCACATGGCGGTGAGGGCGCACCGCGAGCTGATACCCCGGCAGGCGGTGCACCCCCCCGACCACGCCGCCCTGGACCAGCTGGCCAAGAACATCACCCGGCAAGGCATCACCAACTCCACACTCAACTATCTCAGGCTGTGCGTGATCCTGGAGCCGATGCAGGAGCTGATGTCTCGTCACAAGGCGTACGCGCTCTCCCCCAGAGACTGCCTCAAGACCACGCTGTTCCAGAAGTGGCAGAGGATGGTCGCCCCGCCCGAGTCCCAGAGGCCGGCCAGCAAGAGGCGCAAACGTAAAGGCAGCGCCGGAGCCAACGCCGCTCCCCCCGCGCCCGCCAAGAAGCGGTCCCCCGGACCCAACTTCAGCCTCGCCTCACAGGACGTGATGGTGGTCGGCGAGCCGTCGCTGATGGGCGGGGAATTCGGCGACGAGGACGAGCGGCTCATCACCAGGCTGGAGAACACGCAGTACGAGGGCGACGACTGGCCCGCCCCGCCGCCCGCCTCGCCCGCCAAGACCCCGCCGGCCAACCACTGA

Protein sequence:

>DPOGS201612-PA
MARERTERQHRTRPTPIRARWRGNKVAYMPVALGSLGEGEYREYHHHSPLHHQNQHQHQPQYHEYGAIYHPIPEPYFRRHAPYFGQPDYRVYELNKRLQQRTEDSDNLWWDAFATEFFEDDATLTLTFCLEDGPKRYTIGRTLIPRYFRSIYEGGVSELYYTMRQPKESFHNTSITLDCDHCTMVTHHGKPMFTKVCTEGRLILEFTFDDLMRIKSWHMAVRAHRELIPRQAVHPPDHAALDQLAKNITRQGITNSTLNYLRLCVILEPMQELMSRHKAYALSPRDCLKTTLFQKWQRMVAPPESQRPASKRRKRKGSAGANAAPPAPAKKRSPGPNFSLASQDVMVVGEPSLMGGEFGDEDERLITRLENTQYEGDDWPAPPPASPAKTPPANH-