Monarch geneset OGS2.0

DPOGS204430
TranscriptDPOGS204430-TA2136 bp
ProteinDPOGS204430-PA711 aa
Genomic positionDPSCF300002 - 352049-357225
RNAseq coverage464x (Rank: top 27%)
Annotation
HeliconiusHMEL0062395e-15665.91% 
BombyxBGIBMGA007727-TA8e-16456.71% 
Drosophilanct-PA3e-9933.06% 
EBI UniRef50UniRef50_E2BHJ18e-10234.45%Nicastrin n=1 Tax=Harpegnathos saltator RepID=E2BHJ1_HARSA
NCBI RefSeqXP_971861.27e-10835.58%PREDICTED: similar to AGAP003323-PA [Tribolium castaneum]
NCBI nr blastpgi|1892383101e-10635.58%PREDICTED: similar to AGAP003323-PA [Tribolium castaneum]
NCBI nr blastxgi|1892383101e-10435.06%PREDICTED: similar to AGAP003323-PA [Tribolium castaneum]
Group
Gene OntologyGO:00164859.8e-87protein processing
GO:00160219.8e-87integral to membrane
KEGG pathwaycqu:CpipJ_CPIJ0016175e-107 
 K06171 (NCSTN)maps-> Alzheimer's disease
    Notch signaling pathway
InterPro domain[9-663] IPR0087109.8e-87Nicastrin
Orthology groupMCL13755 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204430-TA
ATGAGTTCTTTAAATTTTTTTATGTGGTTTTTAGTAATATTGCTATCAAAAGATTCTGCCTGTGAAAGGCTTCATGAACAGATTTATTCATCTATTGAAGGTGGTGCTGCCTGTTTCAGACGATTGAATGGTACTCACCAAGCAGGATGTACATCCCCCTTGAATGGTGCAGTGGGTGCAGTTCATATGATTTCCAATATATCAGATGCCCAATGGATGATATACAACTCAAGTGCTGGTCCGTATGTGGCCGTTGTCAATACAATAGTTTTCAATGAGGTCATAGAACTTTTTATGCAGGAACCATCAAATGTGGCTGGTATTTTATTATATGAAAATGCTACTGAAAGACCTGAATTCTTCAGTCCGGAAACTCAATGTCCGAATGAAAACTCAGCTGGCTCTGATGGCCAGTGTGCTGGTGATGTTGTATGGAATGAGAATGGTTCTGGTCTTTTGAGGAGAGATATACCTTTTCCAATATTCTTTCTGCCGAGTTCTAGAATCCAAGAAATTGATAAAATTATACAATGTCATGATAGATATAATCTTGATAAGGATAATCAAAAAGGCAGGCCGTTGTGTTCCTTACAATTAAGTTCATTTATGTATGCTGCTGTTAATACCGCTGTGTGTTTAAGGAGGTCAGCAACTTCAATATTTGGAATAGCTATTAAGATGTGTGACCCACTTGGAGATTACAATGTATACTATTCTTTATTCCCACGGCCAAAGGAAACAGCAAAAGAAAAGAAACAGGTAATTTTAGTGACAGCAAGAATGGATTCAGCATCACTCTTTGATGGTGTGGCACCTGGAGCAGCTAGTGCAGTTGTCGGATTGGTAACACTAATAACTACAGCGGCCACACTGTCACAAATGATACCTCAAACTGAAGCCGCTTCATATAATAAAAATGTTCTCTTCACATTGTTTAATGGCGAGTCATATGATTATATTGGATCTCAGAGGGTGGCGTACGATATATCTCAGGGGGTGTGGCCGCCGTTATCTCCCATCACTTCCAAGGATATCAACCTACATGTGGAATTGGGACAATTAGGTGGCTCCTTGAATTTGTTCAAAGACAACCCCAATTGGCCGCTGTACGCGTACGCGCCTTATACATATACTATACCGCCTCAAGTTACAGAGTTTCTTGCTGAGATGTCTTCATACTCTCAGACCAATAATATGACCATCGAATCAGAGTTTTCTATAAACATGCCACCATCTTCCCTACATTCGTTTAGGAAAATTCTTTCCAATGCGACTGAGAGCGGGGAACTACCGGAGATATTGTTAGTCGACCACTTAGGGAAGTTTACGAATAAATACTATGAATCGGCATTAGATGACTATGATAGCATTGGTTATTCGTACCACAATATTAGCATCAGTAATGATGGAAAATTTATACCAACGGACGATTTAATAGCGAACGGCACTATGTCAGAGAATGAGGCGCAAGTTAAGATAGCTCGTGTGTCAACATCACTTGCACACACGTTGTACCAACAGATCGTTGGAACCGCGTACGCTGGAAATATTTCATCATCCGCACATTTGGTGGACGAAATGCTGTACTGTTTCCTCCGAAGCCAGGCTTGCAGGTTGTTGGTGGCTGCGGACTACTACAGTAACGAGGACACGCCCCCTGACGATAGGCCCGCGCCCCTGTATGTGGGGGTCGCTGCTCTGTCCACACCAGCCGCGCTGTACTCCGGCCATCTCCTGGCATTGCTCACGGGAACCCACATACAAGTCAATAGGACCGCCTGCGATAACATCGGCACTCCGGGCTTCTCATATTACTATCTTCGAGGTTGGAATCAAAGTGGTATTTGTATACAAACAACAATGAACTTTAGTCAAGCTATTAGTCCAGCGTTTATTAAACCAGATTATAACATAACATCCGGGGAGTTCTCTACATGGACGGAGTCTGTGTGGCTTGGTCTGTGGGCGCGTGTGTTTGTTCGTGCAGCTGGTGTGGGCGCCCGTGTGGCGGCCGCGGCCGGAGCCTTCACAACCATACTAGCGGCCGTCACCACCTACTGGCTGCAAAGACACGCCAGTGTTATATTCCTGACGCCTGTCGCGGATAATGGGGTGATCAGAAGTGTAAATTGTTAA

Protein sequence:

>DPOGS204430-PA
MSSLNFFMWFLVILLSKDSACERLHEQIYSSIEGGAACFRRLNGTHQAGCTSPLNGAVGAVHMISNISDAQWMIYNSSAGPYVAVVNTIVFNEVIELFMQEPSNVAGILLYENATERPEFFSPETQCPNENSAGSDGQCAGDVVWNENGSGLLRRDIPFPIFFLPSSRIQEIDKIIQCHDRYNLDKDNQKGRPLCSLQLSSFMYAAVNTAVCLRRSATSIFGIAIKMCDPLGDYNVYYSLFPRPKETAKEKKQVILVTARMDSASLFDGVAPGAASAVVGLVTLITTAATLSQMIPQTEAASYNKNVLFTLFNGESYDYIGSQRVAYDISQGVWPPLSPITSKDINLHVELGQLGGSLNLFKDNPNWPLYAYAPYTYTIPPQVTEFLAEMSSYSQTNNMTIESEFSINMPPSSLHSFRKILSNATESGELPEILLVDHLGKFTNKYYESALDDYDSIGYSYHNISISNDGKFIPTDDLIANGTMSENEAQVKIARVSTSLAHTLYQQIVGTAYAGNISSSAHLVDEMLYCFLRSQACRLLVAADYYSNEDTPPDDRPAPLYVGVAALSTPAALYSGHLLALLTGTHIQVNRTACDNIGTPGFSYYYLRGWNQSGICIQTTMNFSQAISPAFIKPDYNITSGEFSTWTESVWLGLWARVFVRAAGVGARVAAAAGAFTTILAAVTTYWLQRHASVIFLTPVADNGVIRSVNC-