Monarch geneset OGS2.0

DPOGS204973
TranscriptDPOGS204973-TA1419 bp
ProteinDPOGS204973-PA472 aa
Genomic positionDPSCF300123 - 243229-252054
RNAseq coverage484x (Rank: top 26%)
Annotation
HeliconiusHMEL0094703e-17168.04% 
BombyxBGIBMGA009345-TA4e-1234.13% 
Drosophilal(3)05822-PB1e-2236.30% 
EBI UniRef50UniRef50_UPI00021A66E18e-4436.98%UPI00021A66E1 related cluster n=2 Tax=unknown RepID=UPI00021A66E1
NCBI RefSeqXP_001607171.13e-3735.73%PREDICTED: similar to CG7129-PA [Nasonia vitripennis]
NCBI nr blastpgi|3838565343e-4734.34%PREDICTED: uncharacterized protein LOC100877961 [Megachile rotundata]
NCBI nr blastxgi|3838565342e-7038.46%PREDICTED: uncharacterized protein LOC100877961 [Megachile rotundata]
Group
Gene OntologyGO:00055151e-19protein binding
KEGG pathway 
InterPro domain[418-473] IPR0014521e-19Src homology-3 domain
[317-334] IPR0001084.9e-11Neutrophil cytosol factor 2 p67phox
Orthology groupMCL19001 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204973-TA
ATGGATTTGTTTAAACAAAATATGAAGGGACCTACGCGGCCAGCTCCCGCCGTACCCAACATGGGTGTCAGTAGATATTCTCCTGTCACTAACTGGAATGACGATCCTTTTGGTGCTGATGTCTTTGAACCGACACCCTCATACGCCCAAAAAAAGAAACCGCCACCAAGGCCGCCGCCGCCTAAAGTGTCTAAGCATCTGGATGTTCCAACAAAACCCTCCCATTTCATACGTAGACCTACAGTGCTGTCTTCAATTTTGGGCCGACACAACAAGAACAAAGTGACCCAACAGAATGCCAGCCAACCTATATTGGATGTGAATAGTGATGTAGACACATACAAAATGTTTCCTAAATGTGATCGTGCAAATAATCAAATGGGTACATTGATAGACTTGTCCTCGCCTCCCAGTTCACCAACATTCACAACAAGATCCAGCAGCGACGGTGTTAGTGTTGACAGTTTCGGTTCAGATGCAACAACATCAACAAATCATCAAAATGCATTTGGTGGTAATGCCTCCCAAGCTGAAAGCGGTTTTGAAGATGATTTTGATCTATTCCTTAATTCAAGAAAACCACTGCACAAAGATGACACCATTGATGATTTTGCAAATGTTGATCCATTTTCTCCATTACCCTGTAAAACTGTGACAAAGAAGACAGTCTTAAGTTCGGAGTCCTATGAGACATCTAGTATAACGAGTAGTCAATATACCGTGCCTTCTTTAAAAGGACCAACTATAATTCGTGCTAAGCCAGCAAGACCCAAACCTCCAGATAATTCGGCGTTGTTAAAGAGCACATTTGGAAGCGATTTCCATATGACCTCAATGAATGTTAATACCACAACACCAATTACCAAAATTAGTAATGGCTTCATACATAATGTGGAAACGAAACCAGACTTCACCTTCACCTGGGATTCATCACCGGAACGGGATTCATCACCACCGATGCCAACGATCCCCCCGCCCCCGCCGCCCGTCATCACCGATGAGGACCTGCCAGTTGTGTGGCCCGAAGACTTACTTGATGATGACGACGAACCATACGCGATAGCCCTGTTCGACTATCACACAGGCCATAGAGACGATTTGTCATTTACCGCTGACACCCGGATTACTCTGATAAGGCGAGAGAACGACGAGTGGATGTATGGTAGGTTGCAAGATGGCAGCGAAGGTCTTTTTCCATCCAATTACGTGGAGGTGAAGGTGCCGCTACCAAACGAACAGCCTAACAACATCGGCACGGCTATAGCGTTATACGACTTTGAGCCGATGCAGACCGGCGACCTTAGTTTCAGTGTGGGCGATAAAGTAACAGTTTTGTCCAAAATAAACGATGAATGGCATTACGGTGAGTGCAATGGCGTCAAAGGACAGTTTCCCGCCAATTACGTCCAAATGAGTTAG

Protein sequence:

>DPOGS204973-PA
MDLFKQNMKGPTRPAPAVPNMGVSRYSPVTNWNDDPFGADVFEPTPSYAQKKKPPPRPPPPKVSKHLDVPTKPSHFIRRPTVLSSILGRHNKNKVTQQNASQPILDVNSDVDTYKMFPKCDRANNQMGTLIDLSSPPSSPTFTTRSSSDGVSVDSFGSDATTSTNHQNAFGGNASQAESGFEDDFDLFLNSRKPLHKDDTIDDFANVDPFSPLPCKTVTKKTVLSSESYETSSITSSQYTVPSLKGPTIIRAKPARPKPPDNSALLKSTFGSDFHMTSMNVNTTTPITKISNGFIHNVETKPDFTFTWDSSPERDSSPPMPTIPPPPPPVITDEDLPVVWPEDLLDDDDEPYAIALFDYHTGHRDDLSFTADTRITLIRRENDEWMYGRLQDGSEGLFPSNYVEVKVPLPNEQPNNIGTAIALYDFEPMQTGDLSFSVGDKVTVLSKINDEWHYGECNGVKGQFPANYVQMS-