Monarch geneset OGS2.0

DPOGS207463
TranscriptDPOGS207463-TA1140 bp
ProteinDPOGS207463-PA379 aa
Genomic positionDPSCF300051 - 102681-106839
RNAseq coverage99x (Rank: top 61%)
Annotation
HeliconiusHMEL0046746e-14161.21% 
BombyxBGIBMGA001205-TA3e-7249.02% 
DrosophilaCG2183-PA1e-2928.11% 
EBI UniRef50UniRef50_Q17PP69e-2928.38%Putative uncharacterized protein n=2 Tax=Culicinae RepID=Q17PP6_AEDAE
NCBI RefSeqXP_002032881.13e-3230.60%GM21013 [Drosophila sechellia]
NCBI nr blastpgi|1953323906e-3130.60%GM21013 [Drosophila sechellia]
NCBI nr blastxgi|1571270435e-2827.60%hypothetical protein AaeL_AAEL000289 [Aedes aegypti]
Group
Gene OntologyGO:00055158.3e-06protein binding
KEGG pathwaygga:4219242e-09 
 K12460 (KIDINS220, ARMS)maps-> Neurotrophin signaling pathway
InterPro domain[1-126] IPR0206831.1e-25Ankyrin repeat-containing domain
[165-241] IPR0109938.3e-06Sterile alpha motif homology
Orthology groupMCL16355 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207463-TA
ATGACACCTGTAATGATGGCCTGTTTGAACAGTACTTCAAACGAAGCAGCTGCTTACAATATTGTATCAAATCTCATACAAAGTAATTGTATGTTGAATATTGGAGATAAATATGGCGTTACACCTCTTATGAAGGCTGTAATTAGCGGCAAGCAATCTATAGTTGAACTCTTAGTGGATACAAATGTCAATATCGAAATGAGAGATAGGCAGGGTTGGACGGCAGTGTTTTGGGCTATCCATCACAACCGGCCAAAAGCTTTGGACTTGCTTCTAAAGAAGGGAGCTAGGCTGAATATAGTGGATATATCAAATCGAACCCCAGCCAAAATTGCATATTCTCATGACTACCTTCACATCCATTCAGTGATATCAGCGTATGAGAAAACCTGTGAGGATGACGATGAAACGATTGAGGAAAAGGAGATCAGCAGACAAAAAGGATTCCTGAGCAAATTGTCCTCATGGCATGATTTTTATCCAGGACTAAGGGATGAGAGCAAACCGAAGTTCGCTCATGAAATATCAAATCTTCTCTATGGTATGAATTGTGATAGACTCAGGGGTGTATTTGATAAGATAAAAATAAATTTAAGAGATTTTCTGCTCATGGAGGAAAAGGAAATGATAAAATATGGTGTTGATTTACCATTTGAGAGACAGAGGCTTAAACAAGGAATCCGTGGATTCCATTTGAGGAGTTGGAAGGTCAATTCCGTGGCTGGTCTACAAACAAGACGTGGTGACCCATACAGTATTGTTGAATGTCTCAGCATACTCGGCTCTCATTTGGAACAGCTGTACATATTGGAGTCAACACTAACATATGTTCTGAGAGATTTCAACAGAATACAAAGTAGATTGAAGTTTGAAGCACCCGACTCACCTGTCATGGTTAGACTGCAGCAGGCAGCCAGCAAGATGATCTGTAACATAAACAGTATCAGGAGAGAGGCGAATGCTATGAAAAAGATACATATTAAGATAAGTAAAGATAGCTTAAGACCCGTCGATCTCATAACGGAGAAGACAACCAAAGATGTAGCCGTAGAATTAATTACTGAACTGGTAGTGCTTAGCTGTATAGGCTTGCTTGTGTATAACGCTAGAAGCCTAGTTACTAAGATCATCGTCAAATAA

Protein sequence:

>DPOGS207463-PA
MTPVMMACLNSTSNEAAAYNIVSNLIQSNCMLNIGDKYGVTPLMKAVISGKQSIVELLVDTNVNIEMRDRQGWTAVFWAIHHNRPKALDLLLKKGARLNIVDISNRTPAKIAYSHDYLHIHSVISAYEKTCEDDDETIEEKEISRQKGFLSKLSSWHDFYPGLRDESKPKFAHEISNLLYGMNCDRLRGVFDKIKINLRDFLLMEEKEMIKYGVDLPFERQRLKQGIRGFHLRSWKVNSVAGLQTRRGDPYSIVECLSILGSHLEQLYILESTLTYVLRDFNRIQSRLKFEAPDSPVMVRLQQAASKMICNINSIRREANAMKKIHIKISKDSLRPVDLITEKTTKDVAVELITELVVLSCIGLLVYNARSLVTKIIVK-