Monarch geneset OGS2.0

DPOGS200804
TranscriptDPOGS200804-TA1266 bp
ProteinDPOGS200804-PA421 aa
Genomic positionDPSCF300249 - 51951-58957
RNAseq coverage4x (Rank: top 88%)
Annotation
HeliconiusHMEL0107283e-1686.36% 
BombyxBGIBMGA000981-TA2e-1536.36% 
Drosophila% 
EBI UniRef50UniRef50_UPI0002061AC71e-1443.90%UPI0002061AC7 related cluster n=2 Tax=unknown RepID=UPI0002061AC7
NCBI RefSeqXP_001190734.16e-1443.59%PREDICTED: similar to ENSANGP00000028549 [Strongylocentrotus purpuratus]
NCBI nr blastpgi|3287114785e-1443.90%PREDICTED: hypothetical protein LOC100573948 [Acyrthosiphon pisum]
NCBI nr blastxgi|3287114784e-1344.87%PREDICTED: hypothetical protein LOC100573948 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00055151.1e-06protein binding
KEGG pathway 
InterPro domain[340-402] IPR0090571.1e-06Homeodomain-like
Orthology groupMCL34374 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200804-TA
ATGATATTTTACCCGGAAATTGTAAGACCATTCCCAAAGGCTGGTCCTAGAAAAGTAGGTACAACTAACAGAAGAAAGAGGAAAGCTGCAATACTAACTGACACACCCGAGAAGAAGGCCCTAGAAGAACAACAAAATAAAACCACAAAAAAGGTTAAGAAGAACATGGATAAGAAAAAGAAACAGGACTATGCTGAAAGTTTACCAAGAGAAACTTGGATCCAGTGTCAAGCTTGCAGGGAATGGGCTCATACAAAATGTGTTCCAAGTGCTGGTCTTACTTTTAATTTTGACGAGTCAGAGATATTCAAGGTGTTCAGTATAAGTACACCCAACGTACTAGCTGAAAGCAAACTAAAAACAAATTGGTCAACTTGTTTCGGCAGAGAGAGGGTTCACTTCAAGGATCACTTCATGGAAGGTACTCCAGAAGGTAGTCTAGGGACGGCTACTAAGAGTGGTTGGATAAATTATGGTATATTTGTGGAGATCCTGAAACTCATTCAAAAAAGAACATGCTGCTCCAGAGACCACTCTATTCTCCTCCTAGTCGACAATCATGGAAGCCACGTTACTATAGAAGCGGTGGACTATGTGCGTGATAACAGTGTAGCTGATGTAACGAATGTTGGATCTTACACATTCGCGAATGCGAATGTGAAGAATTTGTGTGATAAAAAGTACTCGCGCAAAGGCCTGACCACTAGTGCTATGAGAAATCATCTGGAAGCTAAACACAAAATTGAATATGAAGAATTGAAAAAAAGAGAATATGAAAAGCAAAGTGCTGCTAAAATTAATTCTTTGGCATCCTCTTCGCGACAATGTACCAAAAGTGACCTTAAGCAAATGTTTTTGAGTGATTGTGTTGAAAAAAATAAAAAGTGGGATAACAACAACACCAAGTCTTTGGTGGTGGATAATTTGATTGGCGAAATGATCGCACTGGAGGACTTACCATTTAGTTTTGTGGAATGTCTTGGATTTAATGGCAATATGCCCCGAAATTATAAGAGGACAAGTAATCGGCAATCATTGAGTCAAGAAGCGATGAAAAAAGCTATTGAAGCTGTTAAAGAGAAAAAAATGGTTTGGCTTCTGGCTTCTAAGACCTTCAAAGTACCACAAGCTACACTAAGGCGGCATGCTTTGGAACAAAATAAAACACTTGTGTGGGACCTACATTACCTACCACACGCAGCGTCCATACAAAACATCAAGCATATGACCGTCGCGAGCCGAAGGCGCGCGGTTACTAAAACGTAG

Protein sequence:

>DPOGS200804-PA
MIFYPEIVRPFPKAGPRKVGTTNRRKRKAAILTDTPEKKALEEQQNKTTKKVKKNMDKKKKQDYAESLPRETWIQCQACREWAHTKCVPSAGLTFNFDESEIFKVFSISTPNVLAESKLKTNWSTCFGRERVHFKDHFMEGTPEGSLGTATKSGWINYGIFVEILKLIQKRTCCSRDHSILLLVDNHGSHVTIEAVDYVRDNSVADVTNVGSYTFANANVKNLCDKKYSRKGLTTSAMRNHLEAKHKIEYEELKKREYEKQSAAKINSLASSSRQCTKSDLKQMFLSDCVEKNKKWDNNNTKSLVVDNLIGEMIALEDLPFSFVECLGFNGNMPRNYKRTSNRQSLSQEAMKKAIEAVKEKKMVWLLASKTFKVPQATLRRHALEQNKTLVWDLHYLPHAASIQNIKHMTVASRRRAVTKT-