Monarch geneset OGS2.0

DPOGS214954
TranscriptDPOGS214954-TA1743 bp
ProteinDPOGS214954-PA580 aa
Genomic positionDPSCF300280 + 126523-129766
RNAseq coverage411x (Rank: top 30%)
Annotation
HeliconiusHMEL0155952e-12673.33% 
BombyxBGIBMGA004851-TA1e-16863.79% 
DrosophilaH-PA1e-2749.38% 
EBI UniRef50UniRef50_Q16Y992e-3834.16%Putative uncharacterized protein n=2 Tax=Culicinae RepID=Q16Y99_AEDAE
NCBI RefSeqXP_001653350.13e-3934.16%hypothetical protein AaeL_AAEL008617 [Aedes aegypti]
NCBI nr blastpgi|1571193086e-3834.16%hypothetical protein AaeL_AAEL008617 [Aedes aegypti]
NCBI nr blastxgi|1571193081e-4631.98%hypothetical protein AaeL_AAEL008617 [Aedes aegypti]
Group
KEGG pathwaydme:Dmel_CG54609e-26 
 K06064 (HAIRLESS)maps-> Notch signaling pathway
Orthology groupMCL20352 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214954-TA
ATGACAGAAGAAGTAGACAGAAGACATGGTGTGAATGGGAACAGCAGTAGTAAGAGTGCTCATGAGGATCCTCCTGTGCAGAGTCCAGCTCCCGCTACGACGGGTGGGAGGCTCAAGTTTTTTAAAGATGGAAAGTTTATATTGGAACTGGTCCGAGGCAGCGCGCGCGAGGGTGAGCGCGCTGGGTGGGTGTCGGTGCCTCGTAAGACGTTCTGGCCGCCGGCCGCCGCCCCGCCGACGCCTCCTCACGCGGCGCCGCCCTGCGCCTCGTTGTCGCTGTCAGACGACAATTCCTCGCTGCACTCCTCGCCCTGCACTTCGCACCGCGACCACTGCTGGAAGCAGCCGACGCCGCGCCGCAACCTGTCCAAAGAGTTAGCCATGTACTACTGCAGGCCCGCTACCCTGCACACCGCTCATATCGCCACCGCCGCACGCCTCAAGCGAAGGAGACCCTTCGACACCGGCTACCATGAGATCCTCACCAACGGAGCGCTCCGGAAGACCTGTGTGAACGGCTCCTGTGACAAAAAACGAAACGGTGACGTCGGCTCGCCCGAGAACAAGCATTGTAAAGGAGATGACGTCGTCGACGGGCCGACGGAGGCCAAGACGAAGACGGACGAGGACAGGTGGACGGACTTCGACAGGGAGAAATACTTTCATATGAAACTCAAAAAACCATATCAGTATCATAAGTTAAGGGTGTACAAAAAGACGAGGCCGGCGTTACGACGCAAAGAGTTGGCCCGGGTCTTAGAAAGACTCCGAGAGAAAATTCTGTCGTTGCCCGTGCCCGTGAACGCCAAACTGGCAAACTGCAGGCAGGAGCACATGATGGTATCTCCGAGGAAGCGGATCCTGCGCGAGATGGAGCGGGTCAGTCTCGAAGACCAGGCGACCAAGCGGCGAGCGAAGACGGTCCCCGCTCTTAGCACGGCCTCCTACCCGCCCTCCCCCGGCCCCAGTCATACACAGCACCGCGGATCAGACGGGCCAGCGCGACTGTCCAACGGCACGGCTCCCAGGAAGGAGACCGCGGTGTCCAAGAACGTCAGCAGCTACAGCATACACTCGTTGCTCAGCATGCCTGATGAGAGTCCTACGCGCCGCTCGCCCGAGGCCAAGCGTTCACCGCATTCCTACCCACCGTCATTGAAGACGGAGTCTCCTTCGAGCGTCAACTCGCCCGATTTGAGCCCCAGTCCAGACAGCTACCGGTACAGGTACTCGACGCTCTCGCTGGGGTCCCCGGGTCGCGGGGCGGCGCGTGATTCGCCCACGCCGCCACAGCCCACCAACCCGCCTTCCTTCCGTGCATATGCACCGCCCACGTCCCCGTACAGTGGTCGTCCGGGGCCCGCGTGGCCTGCTCCGCCGCCGCCGGGGCCCTTCAGACGAGATGAGTGGGCTGGCCCTGGGGCTGTGAGCGGCATGAGCAGCGCACAGTATGTGTTCGGGTATGGATACGCTCCACACGTGTACCGCGCCGCGCCTGCGCCCCCTCTGTGGATGCATTACGCGTTGGCCCCGGGTGCTCCTCCCGGGCCGTGGGCGCCTCTCGCTCATCCCCTGCTCACAGACCACATACCCAAGGAGGAGCCCACGTCCGGTCAATTTGCCGTTAAACTTATCAAAACATTGAAGCCGACGTCAATTGTGACGTGTGACGTCACGCGTCCGTCTCGGCGGGTCGTCTACGAGTCGAGCCCGTCACGCGACACGTCACTGGCGGGAGGGTAG

Protein sequence:

>DPOGS214954-PA
MTEEVDRRHGVNGNSSSKSAHEDPPVQSPAPATTGGRLKFFKDGKFILELVRGSAREGERAGWVSVPRKTFWPPAAAPPTPPHAAPPCASLSLSDDNSSLHSSPCTSHRDHCWKQPTPRRNLSKELAMYYCRPATLHTAHIATAARLKRRRPFDTGYHEILTNGALRKTCVNGSCDKKRNGDVGSPENKHCKGDDVVDGPTEAKTKTDEDRWTDFDREKYFHMKLKKPYQYHKLRVYKKTRPALRRKELARVLERLREKILSLPVPVNAKLANCRQEHMMVSPRKRILREMERVSLEDQATKRRAKTVPALSTASYPPSPGPSHTQHRGSDGPARLSNGTAPRKETAVSKNVSSYSIHSLLSMPDESPTRRSPEAKRSPHSYPPSLKTESPSSVNSPDLSPSPDSYRYRYSTLSLGSPGRGAARDSPTPPQPTNPPSFRAYAPPTSPYSGRPGPAWPAPPPPGPFRRDEWAGPGAVSGMSSAQYVFGYGYAPHVYRAAPAPPLWMHYALAPGAPPGPWAPLAHPLLTDHIPKEEPTSGQFAVKLIKTLKPTSIVTCDVTRPSRRVVYESSPSRDTSLAGG-