Monarch geneset OGS2.0

DPOGS212467
TranscriptDPOGS212467-TA1389 bp
ProteinDPOGS212467-PA462 aa
Genomic positionDPSCF300222 - 625626-644769
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0093430.074.90% 
BombyxBGIBMGA009643-TA1e-9858.74% 
Drosophilaen-PA4e-5771.81% 
EBI UniRef50UniRef50_B8ZX060.071.63%Invected n=3 Tax=Papilionoidea RepID=B8ZX06_PAPDA
NCBI RefSeqNP_001037454.12e-14461.90%homeobox protein invected [Bombyx mori]
NCBI nr blastpgi|2196860810.071.63%invected [Papilio dardanus]
NCBI nr blastxgi|2196860810.071.63%invected [Papilio dardanus]
Group
Gene OntologyGO:00063551.2e-23regulation of transcription, DNA-dependent
GO:00435651.2e-23sequence-specific DNA binding
GO:00037001.2e-23sequence-specific DNA binding transcription factor activity
GO:00036771.3e-22DNA binding
GO:00055158.1e-21protein binding
GO:00056343e-16nucleus
GO:00072753e-16multicellular organismal development
KEGG pathway 
InterPro domain[362-424] IPR0013561.2e-23Homeobox
[340-420] IPR0122871.3e-22Homeodomain-related
[361-435] IPR0090578.1e-21Homeodomain-like
[420-450] IPR0195494.5e-18Homeobox engrailed, C-terminal
[361-378] IPR0007473e-16Homeobox engrailed
[391-400] IPR0000472.4e-06Helix-turn-helix motif, lambda-like repressor
[384-395] IPR0204799.5e-06Homeobox, eukaryotic
Orthology groupMCL20509 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212467-TA
ATGGCGGCCGTCACCGCACATCTGGATCAGATCAAGATCCAGGATCCTAGCGACGAGGATCCAGAGCCGTACTCCCCAAACACCAGAGACACGACCAGCCCCGACTACGAGGAGAAAGACAGACCAGTACATTCATCCTCGTTCTCCATCCACAACGTCCTAAAGAAAGAAAGAGATAGCCCGGAGAATGTGTTCTCCACTGACAAACTGCTGCAGAATACGCCTAATTTCGAGGAAGGTTCGAGAAATTCCAGTATTAGTCCAAGGTTGGACGATGATCACGAGAGAGCTGATATCAGTGTTGATGACTCCTGCTGCAGTGACGACACCGTGTTGTCGGTTGGCAACGAAGCGCCTGTGTTCGATAAGGCACCAGAGGCCCAAGGAATTACCACCTTCAAACACATTCAGACCCACCTAAATGCTATATCCCAGTTAAGCCACAACCTAACTATGAACCAGCCCCTCCTTCTGCGACCGAATCCGATAGCACCAAACCCATTGATGTTCCTCAACCAGCCGATGATGTTCCAAAACCCTCTGATGAATCACGAGCTCAAAGCCAATGTACCTCGAATGCCGATAGCCCAGAACAGCCTGAACATGAGCCAGTTCAATATAAACTTCGGGAGCAAGTCCCACAAGAGCGACGAGAACCGGCACCAAAATCAGAACTACTCTCCAAAATCTCCAGACAATGAATCGGAGAGGGACTTCATAAACCAGAGCTGTCTGAAATTCAGTATAGACAATATCCTGAAAGCTGACTTCGGCAGGCGGATCACGGATCCTCTGACCAAGCGAAAAACTTCGAAAGCGAGGCAGTACGAGAAGACCAGCCCCGTGAAGGAGGTGACTCCAGTGAAAGAAGTGGAGGCGAGGGTCCCGGAAGTAAAGCCAGCTGATAAGGGCGCGATAGACCTCTCAAAAGCCGATGACAGTGGGAGCAACGCTTCCTCGACTCCTGGTACGACTGGTGAAGGTCCCATGGTGTGGCCGGCCTGGGTGTACTGCACCAGATATAGCGACAGACCGAGTTCCGGTCCCAGGAGTAGACGGGTGAAGAAGAAGGCGAGCCCTGAGGAGAAGAGACCGAGGACTGCCTTCAGCGCCTCGCAGCTAACAAGATTAAAGCACGAGTTCGCGGAGAACCGCTACCTGACGGAGAGGAGGAGGCAGGCGCTGGCCGCGGAGCTGGGGCTGGCGGAGGCTCAGATCAAGATCTGGTTCCAGAACAAGAGGGCCAAGATCAAGAAGGCCTCGGGCCAGAGGAACCCGCTGGCGCTGCAGCTCATGGCGCAGGGGCTGTACAACCACAGCACCATACCGTTGACGAAGGAGGAGGAGGAGTTAGAGATGAAGGCCAGGGAGAGGGAGCAGAGGAATTGA

Protein sequence:

>DPOGS212467-PA
MAAVTAHLDQIKIQDPSDEDPEPYSPNTRDTTSPDYEEKDRPVHSSSFSIHNVLKKERDSPENVFSTDKLLQNTPNFEEGSRNSSISPRLDDDHERADISVDDSCCSDDTVLSVGNEAPVFDKAPEAQGITTFKHIQTHLNAISQLSHNLTMNQPLLLRPNPIAPNPLMFLNQPMMFQNPLMNHELKANVPRMPIAQNSLNMSQFNINFGSKSHKSDENRHQNQNYSPKSPDNESERDFINQSCLKFSIDNILKADFGRRITDPLTKRKTSKARQYEKTSPVKEVTPVKEVEARVPEVKPADKGAIDLSKADDSGSNASSTPGTTGEGPMVWPAWVYCTRYSDRPSSGPRSRRVKKKASPEEKRPRTAFSASQLTRLKHEFAENRYLTERRRQALAAELGLAEAQIKIWFQNKRAKIKKASGQRNPLALQLMAQGLYNHSTIPLTKEEEELEMKAREREQRN-