Monarch geneset OGS2.0

DPOGS207923
TranscriptDPOGS207923-TA1179 bp
ProteinDPOGS207923-PA392 aa
Genomic positionDPSCF300349 + 159688-177354
RNAseq coverage645x (Rank: top 20%)
Annotation
HeliconiusHMEL0165993e-1326.52% 
BombyxBGIBMGA000073-TA4e-8460.00% 
DrosophilaCG8359-PA9e-0823.77% 
EBI UniRef50UniRef50_UPI00017E10DA5e-1926.84%UPI00017E10DA related cluster n=1 Tax=unknown RepID=UPI00017E10DA
NCBI RefSeqNP_001127789.11e-1926.84%dorsal interacting protein 3 [Nasonia vitripennis]
NCBI nr blastpgi|1972453762e-1826.84%dorsal interacting protein 3 [Nasonia vitripennis]
NCBI nr blastxgi|3800109351e-1925.94%PREDICTED: uncharacterized protein LOC100865216 [Apis florea]
Group
KEGG pathway 
InterPro domain[14-116] IPR0065781.2e-15MADF domain
Orthology groupMCL34723 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207923-TA
ATGGTGAGGACCAAGGACGGGGACGGCATCAAGAGCGGTACCATAGTGTCCGAGGTTCAGAAGCGGCCCTGTCTGTACGACACCAACGACATGAACTACGGAGATAGGGCGGAGAAGATGAGGAGCTGGGAGGAGGTCTGCGAGAGCGTGGTCCCAGGCTGGAGCGGCCTGGGGCTGGACGAGAAGCTCTCGGCCGGTAAGGAGGTCCAGAAAAGATGGCGCTCCATCAGGGACTCGTACACCAAGGCGTTTAGACAGGGCAAGTGTCCGCCGCCGGAGCAGTGCGCCCCGGGTAGCAGGAGATACCAGTACCACAGACAGATGTCCTTCCTGCTGAAGGCTCTGCAGAACAAAAAACCACGTTACTCACAAGATAAATACGAATCATTCTCCGAAGTATCGAACTCGCCCCAGCATCCGCCGCCTGAAGACGTGTCGCGGGTCAAGAACGAGATCACAGACAAGCCTCTGGACCTCAAGACCAAACCTGAAACAGCTGACAGACCGGAAACACAGGACAAATACAGCCAGGCGTCCATAGACAAGAAGCTGGAGATAACAACAGACGCGACCCACATACTACCAAATGATCACTTCGACGATGACAAGCTGTTCATGAATTCATTACTGCCGTTATTCAAGAAGATGACCGACGACACGCGTCTGTTGTGTAGAATAGAAGTGTTGAAGATCATCAGGTACGCGCTACAAGGACACCAGGCCTTCGACCCGCCCAAGATGAACGACGATAGCTACAAGCAAGTGAATGTTGAGAAGAGGGATGCCACAGACAGTAAGACCGAGAGCACACTGTCTATGACCACAAGATCAGCTGACGGAAGTCGACCCATCAGGAAACGGAGGGCTCGATCCCCGTCCCCACTTCCTGTACCACCTAAACGTAGAGGACCAGGTCGGCCCAGGAAGATCCGTCCGCCTCCCTCTGATTCCGAGGAGGACGTTCCTCAGAGAAGATTTCCCAAGCTGAAGACATCTCCGGTAGACTCTCACGATGAGGACTACAGTAAAGCAACCAGTGTGGCTCAGCTCTCCACTCCACTCTTCATGAAGATGTACAACTTGGAAAGATCAAAAGTGGCTCCGATAACATCAACACAGTCCATGCTCGTTTCTGTAAAGACAGAACCTCTCGATACGCAGTCAGTTAGCCCCATGTAG

Protein sequence:

>DPOGS207923-PA
MVRTKDGDGIKSGTIVSEVQKRPCLYDTNDMNYGDRAEKMRSWEEVCESVVPGWSGLGLDEKLSAGKEVQKRWRSIRDSYTKAFRQGKCPPPEQCAPGSRRYQYHRQMSFLLKALQNKKPRYSQDKYESFSEVSNSPQHPPPEDVSRVKNEITDKPLDLKTKPETADRPETQDKYSQASIDKKLEITTDATHILPNDHFDDDKLFMNSLLPLFKKMTDDTRLLCRIEVLKIIRYALQGHQAFDPPKMNDDSYKQVNVEKRDATDSKTESTLSMTTRSADGSRPIRKRRARSPSPLPVPPKRRGPGRPRKIRPPPSDSEEDVPQRRFPKLKTSPVDSHDEDYSKATSVAQLSTPLFMKMYNLERSKVAPITSTQSMLVSVKTEPLDTQSVSPM-