Monarch geneset OGS2.0

DPOGS214467
TranscriptDPOGS214467-TA870 bp
ProteinDPOGS214467-PA289 aa
Genomic positionDPSCF300122 - 629042-633925
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0037464e-14190.44% 
BombyxBGIBMGA013353-TA1e-11490.87% 
Drosophilaonecut-PA2e-4292.77% 
EBI UniRef50UniRef50_A4II002e-4647.78%Onecut1 protein n=26 Tax=Euteleostomi RepID=A4II00_XENTR
NCBI RefSeqXP_002429426.14e-5554.15%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420179087e-5454.15%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420179081e-5352.96%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00036771.7e-38DNA binding
KEGG pathwayxtr:1001017631e-46 
 K08026 (ONECUT1, HNF6)maps-> Maturity onset diabetes of the young
InterPro domain[163-235] IPR0109821.7e-38Lambda repressor-like, DNA-binding
[157-237] IPR0033507.8e-32Homeodomain protein CUT
Orthology groupMCL26063 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214467-TA
ATGGACGATACACGGCTCAGGGAGCGAGCGCCTCTCACGGTCATAGTGGCGCCCAGTAACGTGTCTCCACCCCGGCTGTCCCCCGCGGACCTGCTGCCCGACGGAGACGCTGCCTTCCACCCGCTGTCCGCCGTCAACGGCCGCCTCACACCGCCCGGGCTCGAGCCCGCGTCCTACGCTACTCTGACGCCCCTGCTCCCTCTGCCCCCCATCAGCACCGTGTCCGACAAGTTCGCGTACCACGCGGGCGGCACTTTCACGGTCATCCAGCAGCAGCAGTCCTACGCGTCCTTGTCTCCGACCGCCTACAATGAGCCGCTGTCGCCGCAGTCCGCGTACAGTCGGCGGAGCGCGTCACCCGGCTCGTACGAGCGCCGTTCCCCCTCGCCGCCGCTGCCCAGCCCGGGGCTGGACCTGAACGCGGCGCTTCTGGCCAGAGAGACGAGAGACGAGCAGGCGCAGCAGACGCAGCAGAACGACACGGAGGAGATAAACACCAAGGAGCTCGCGCAGAGGATAAGCGGAGAACTGAAGAGGTACTCCATACCTCAGGCGATATTCGCTCAGAGGGTGCTGTGTCGGTCGCAGGGTACGCTCAGCGACCTACTCAGGAACCCCAAGCCGTGGTCCAAGTTGAAGTCGGGCCGAGAAACCTTCAGGCGGATGTGGAAATGGCTACAGGAACCCGAGTTTCAAAGGATGTCGGCCTTGAGACTTGCAGATGCGCCCAATCAAACGATAAAAGGCGACAATGTAAACAACATGCAGCAGCAGTCATATCCCAGGGAGTATCAGCCCGTGGCCCCGGCCGTGTACCCGCCCTGGGAAACTCCGGCTTACGAACCCGCCGGGGACGTAACAGGCTTGTGA

Protein sequence:

>DPOGS214467-PA
MDDTRLRERAPLTVIVAPSNVSPPRLSPADLLPDGDAAFHPLSAVNGRLTPPGLEPASYATLTPLLPLPPISTVSDKFAYHAGGTFTVIQQQQSYASLSPTAYNEPLSPQSAYSRRSASPGSYERRSPSPPLPSPGLDLNAALLARETRDEQAQQTQQNDTEEINTKELAQRISGELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLADAPNQTIKGDNVNNMQQQSYPREYQPVAPAVYPPWETPAYEPAGDVTGL-