Monarch geneset OGS2.0

DPOGS206973
TranscriptDPOGS206973-TA1272 bp
ProteinDPOGS206973-PA423 aa
Genomic positionDPSCF300001 + 222806-225638
RNAseq coverage304x (Rank: top 37%)
Annotation
HeliconiusHMEL0143611e-11895.48% 
Bombyx% 
DrosophilaCG12581-PB4e-7374.56% 
EBI UniRef50UniRef50_E2A3716e-8242.97%Putative uncharacterized protein n=4 Tax=Formicidae RepID=E2A371_CAMFO
NCBI RefSeqXP_001810169.12e-8159.47%PREDICTED: similar to CG12581 CG12581-PA [Tribolium castaneum]
NCBI nr blastpgi|3320258705e-8444.66%hypothetical protein G5I_05417 [Acromyrmex echinatior]
NCBI nr blastxgi|1892386261e-10046.50%PREDICTED: similar to CG12581 CG12581-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055155.1e-10protein binding
KEGG pathway 
InterPro domain[4-161] IPR0060205.1e-10Phosphotyrosine interaction domain
Orthology groupMCL16358 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206973-TA
ATGGAATCGACGCCGATCTGTCGGTGCCGTGTATTGTATCTGGGATCCGCGGTGCCTCAGCAGAGCAAGGACGGCCTACAGGGCATACAAGAGCCCTTGCGAGAACTGTATCCGGAAAAGGGGGCGACCAGTGGTGGAATTGATTCTTGGCTTTCCGTATGGTCAAACGGAATTTTGCTAGAGAATGTCGACGAAAGCGGCTCCAGAGTATCAAGGTTCTTCCCAATATCTAGTTTGCATTATTGCGCGGCCGTAAGACGTGTAAGTGTTGAGGGGCAACCTAGATTTTTACCTTTAGATTCACCTTTTGCTAGGGCGCCGGCTCCGAGGCGTCCGCCTTTATTTGCTGCTGTGCTCCGTCGCACACAGGGTATAAAAGTATTAGAATGTCATGCATTTATTTGTCGTCGAGAAGCGGCTGCAAATGCACTCGTCAGATGCTGCTTTCATGCTTACGCTGATAGCTCCTATGCAAAACGATTAGAAGCCGAGCGTTCCCCGTCCCAATTGCCACCAGAGGCCGATGAGGAGTTGGAAGTATACGATGGTGACGAAAATCATAAAGTTTGGGTTGGCGAAGTCGAGCGTGATGACGCAAGCGACGAAATGCCTCATCCGACACGGGCACCGCGACCGCGTCAAATTACAAGACCGGCCTCGGTGCCACCTCCGCCCCCTCCGCCAGAGGAGCCCAAGAAGAAATCCACCTCCAAGAAGTCTAAGAAGAAATCAGCGGCGTCCGTTGATGAGTTATACGCTACGATGCCAGGTCCGGCACCTGTCATGAATGGTAGGTCCTTGGGGAGGGCTCCTCCGGCCTGGGCGGCTGCCGGCCATCCTGGCCATCCCGGTCATCCAGGTCACCCTGGCCATCCTGGGCATATGGTATTGGTGGCGCACCCCGCAGCCACACTGCCTCATGTTCGACACGCAACCATGGGTCACAGGGGTCGGGTCCCAGCTGCTCTAGGACCATTCCCTCCCGTGCCCCCGCCTGGAAGAGGTCGCATACAATACGCTACAGTGGACCCGCGACGCTCGAAGCCTCCGCCCTCGGCTATGAAGGCGGCGCAAAGTATGAACGGCCTGGAAGCTGTCGAGGATTCCGGCGGTATATATAGAAAAAAAGGACATTTAAACGAAAGGGCCTTCTCTTATAGCATACGACAGGAACACAGAAGTCGTTCCCATGGATCTCTCGCCAATCTGAAATTCGCTGCTCCACCCGAGTTAAAATATCCCAAGGCAAACATCCAAGAACCCACGTCATAA

Protein sequence:

>DPOGS206973-PA
MESTPICRCRVLYLGSAVPQQSKDGLQGIQEPLRELYPEKGATSGGIDSWLSVWSNGILLENVDESGSRVSRFFPISSLHYCAAVRRVSVEGQPRFLPLDSPFARAPAPRRPPLFAAVLRRTQGIKVLECHAFICRREAAANALVRCCFHAYADSSYAKRLEAERSPSQLPPEADEELEVYDGDENHKVWVGEVERDDASDEMPHPTRAPRPRQITRPASVPPPPPPPEEPKKKSTSKKSKKKSAASVDELYATMPGPAPVMNGRSLGRAPPAWAAAGHPGHPGHPGHPGHPGHMVLVAHPAATLPHVRHATMGHRGRVPAALGPFPPVPPPGRGRIQYATVDPRRSKPPPSAMKAAQSMNGLEAVEDSGGIYRKKGHLNERAFSYSIRQEHRSRSHGSLANLKFAAPPELKYPKANIQEPTS-