Monarch geneset OGS2.0

DPOGS205779
TranscriptDPOGS205779-TA1296 bp
ProteinDPOGS205779-PA431 aa
Genomic positionDPSCF300144 - 356067-364186
RNAseq coverage1156x (Rank: top 11%)
Annotation
HeliconiusHMEL0115789e-13570.40% 
BombyxBGIBMGA010576-TA1e-16478.59% 
Drosophilanumb-PA2e-11458.59% 
EBI UniRef50UniRef50_E0VRS41e-11864.10%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VRS4_PEDHC
NCBI RefSeqXP_319339.41e-12560.42%AGAP010167-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582992132e-12460.42%AGAP010167-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3454849772e-12060.69%PREDICTED: protein numb-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055154.3e-49protein binding
KEGG pathwayaga:AgaP_AGAP0101674e-125 
 K06057 (NUMBL)maps-> Notch signaling pathway
InterPro domain[71-215] IPR0119934.3e-49Pleckstrin homology-type
[84-214] IPR0060201.5e-40Phosphotyrosine interaction domain
[268-375] IPR0104497.4e-27NUMB domain
Orthology groupMCL12215 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205779-TA
ATGGGCAACCAAGGTTCCTCGCCTCACGAACCGTTGGATCGAGTCCAGGTGCAAAATGGAGATCTAAAAATGAAGTCATCAGTCCGTAGCTCGTTGAGGAGGGCGCGGGCGGGGGCTACTCTGTCGCCGCCGCGCGCCGGCATGGAGCGGCTGAGGCGCTCCTTTAGGGAGTCCTTCCGCAGGAGGAAGGGCTCCCCGCCGGAGTCTGCGAGGCCCCACCAGTGGCATGCCGACGAGGCCGCGGTCAGGGCCGGGACATGCACCTTCCCAGTCAAGTATCTAGGATGTGTTGAGGTGTTCGAATCCAGAGGAATGCAGGTCTGCGAGGAAGCACTTAAAGTATTGAGAAATTCCCGTCGTCGGCCCGTCCGTGCTGTTCTTCATGTGAGCGGTGACGGACTCAGGGTTGTGGAGGAGGAGACCAAAGGCCTGATAGTCGATCAGACCATAGAGAAAGTCTCCTTCTGTGCTCCAGACAGGAACCACGAGAGGGGTTTTAGCTACATATGCCGTGACGGGACGACACGTCGCTGGATGTGTCACGGTTTCCTCGCGTCCCGGGACAGTGGGGAGCGTCTCTCTCACGCCGTGGGGTGCGCATTCGCAGCTTGCCTCGAAAGGAAACAGAGGAGAGACAAAGAGTGCGCCGTCTCCATGAGTATAGACGCCGCCAGCCACGCCTTCACCAGGCAGGGAAGCTTTAGGAAATCAGGTATAATCAGTCGTCGTACATCAGAGCCGGCAGAGGTGCCGCAGTCTCCAGGCAGCGTGACTACCCCGAGCTCCGGACGGGTCGCTCACAATCCGTTCGCGGTGGAGCGGCCCCATGCTGCACCACACCTGCTAGAGAGACAGGGCTCCTTCCGCGGTTTCGCTCATCTCAATAACAATTCGCCATTCAAGCGGCAGATGTCGCTACGTATATGTGAGCTGCCGTCCAACTTAGAGAGGCAGCGTCTCGGTCTCGGTTCGCCCTCCAACGGCGTCCCCGCCTTACCTGCGGCGCCGGCTGTACCCGCGCTGCCCACCCTACCCACACCCAGCCCTAAACCAGATGTAGCCGCTATTGAGGATAAGTCATCGGATCCGGTAGCGGAGATGTGTCAGCAGTTGTCTTTGGGTCTTCGCGCGTTGGCCGAGGAGCCGGTCCCAGCGGCGGCGGGGGCCCTCCCACACCCCGACGCCTGGCTGGGTCGGGTCGCACGAGCTCCCGCCCTTGCCAGCGCTGGCAGAGCTGCTTCTTTCGCTGGACGAGCCGCAACTAACCCCTTCATCACCGCCCCCGCGCCGCTATAG

Protein sequence:

>DPOGS205779-PA
MGNQGSSPHEPLDRVQVQNGDLKMKSSVRSSLRRARAGATLSPPRAGMERLRRSFRESFRRRKGSPPESARPHQWHADEAAVRAGTCTFPVKYLGCVEVFESRGMQVCEEALKVLRNSRRRPVRAVLHVSGDGLRVVEEETKGLIVDQTIEKVSFCAPDRNHERGFSYICRDGTTRRWMCHGFLASRDSGERLSHAVGCAFAACLERKQRRDKECAVSMSIDAASHAFTRQGSFRKSGIISRRTSEPAEVPQSPGSVTTPSSGRVAHNPFAVERPHAAPHLLERQGSFRGFAHLNNNSPFKRQMSLRICELPSNLERQRLGLGSPSNGVPALPAAPAVPALPTLPTPSPKPDVAAIEDKSSDPVAEMCQQLSLGLRALAEEPVPAAAGALPHPDAWLGRVARAPALASAGRAASFAGRAATNPFITAPAPL-