Monarch geneset OGS2.0

DPOGS210083
TranscriptDPOGS210083-TA1812 bp
ProteinDPOGS210083-PA603 aa
Genomic positionDPSCF300017 + 245688-255561
RNAseq coverage88x (Rank: top 63%)
Annotation
HeliconiusHMEL0117896e-6045.63% 
BombyxBGIBMGA012735-TA1e-12249.14% 
Drosophilaunc-5-PA2e-4342.79% 
EBI UniRef50UniRef50_F4W4103e-4743.51%Netrin receptor UNC5C n=6 Tax=Formicidae RepID=F4W410_ACREC
NCBI RefSeqXP_391817.33e-4643.04%PREDICTED: similar to unc-5 homolog B, partial [Apis mellifera]
NCBI nr blastpgi|3454836525e-4847.21%PREDICTED: LOW QUALITY PROTEIN: netrin receptor UNC5C-like [Nasonia vitripennis]
NCBI nr blastxgi|2700081233e-7842.67%unc-5 [Tribolium castaneum]
Group
KEGG pathwayame:4082649e-46 
 K07521 (UNC5)maps-> Axon guidance
InterPro domain[178-259] IPR0137839.3e-16Immunoglobulin-like fold
[183-259] IPR0035996.3e-10Immunoglobulin subtype
[178-254] IPR0130986.3e-09Immunoglobulin I-set
[189-247] IPR0035983.8e-08Immunoglobulin subtype 2
Orthology groupMCL26004 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210083-TA
ATGAGTGTGCAAAATGTGATACGGATTATACTGGTGATGTGTGTTGTTGGAAAAGTTAGAAATGGAGGTCCGGAACAGACGACGGAAAAAGAGACAAGACCTGAGTACATACATCCAGACGGTTTACTTAACTTCGACCCTTATTTATCAGACTTTAAACCGGTTGAGGAAGAAGACGATTACATTCCACTACCAGAGAACGACGAGACCATAGACGGTCTGCCGATATTTCTCGTAGAACCAAAGAATTCCTTCGTTCTTCGCAGCAAACCAGCCACCTTGCTGTGCAGGGCCGCCAATGCTTTGCAGGTTTACTTCAAATGTAATGACGTGAGGACGGACAAAACGGTTCAGCTCGAACACGTAGACCCTCAGAACGGAGTGAGAGTGGTGGAAGCCGAACTGAATATCACAAGGAATGAATTGGATGAATATTTCGGAGGGAAATACAGTTGTGAGTGTTACGCGTGGAACAGCAAGGGCAGGATTCGGAGTCAAGGTGTCTTCATTGAGTTTGCTTATATCAAGAAGCAGTTCTCTCAGCAACCTCAGTCAGTGACGGCGGAGGCTGGACGGCAGGTCACTTTCCATTGCAGCCCTCCACCATCAGCACCTCCGGCAACCATCAAATGGATCCGCAATGGACTCACAATTGAACCAACCGACGAAACTCTGGTCTTGCCCAGAGTTGGAATACAGGATATGGCAAATTACACCTGCATAGCGGAAAATATCGCCGGTCGTCGGGAGTCCGATGTTGCAGTGCTATCAGTATACGGTAAGGAATGCCTGTTTTTTATTGAAGACATCTTTATCTACTCGCGTTCCTGTATTTCCTTTGCAAATATAGATTCAATGGCGGTTGGTCTGACTGGAGTCCCTGGATTCCTTGTCGGTGTGGATCCCAGACCAGTGGAAGACGGAGGACGAGGACCTGCACGGAACCAGCGCCAGCAAATGGTGGAGCACCCTGTAGAGGGCAGTCCGCTCAGAGTGACGATGACTGTCTCAGATGCCAAAGCGGTTCATGGTCCGCTTGGAACTCCTGGTCGTCTTGTAGTGAGGAATGCGTCAGGGTGCGCCGGAGACAATGTATCGGTACTTGTTCAGGATCCACTTTACAACACGCCGCCTGCGTTGACGGCTTATGCTCAGCTGGTCTCATGTGAAGTGTACTTAAAGCCTTTTTATGGCATCGGAGTGGCGTTTATCGATAAAAAGACAACAGTGGACTTTACTGCCTCTATAAGGTTGTACAAGAACAACATGAGCCTTTACGTGGGTATCGGCATTACGGTAGCGATGCTGGCCGCGGGTGCTGCTATAATGTACGTGTGGTGCCGCTATCGGACTGTCAGACCGGGATACAGTGCCGCAAGGACCGGTCAGTATAAACATTACCCTGATTTTGTAACCGGACTGTTAGAACGGGTGTACTCCCGCGACAAGGCCGCAGCTCCCGACCTCACTCGGACCTGCAATGAGTACTGGATGAGACCAGACAATCATTACGACATGCCGCAGATGAGAGACAGCTACGCGTCTCCGTTCGGTAACCGCAGTCACCAGCAGTCGTCTGCGGGTAGCAACCACTACGAGATGAAGCCCTACAGTCCCTCAGCCGAGTCCGCCTCCAGCTGTTACACTAATTCCCGCACGATGACGTCACGTTCGAGCGAGTGTTCATCCCAGTTGGCTGAGCTGGTGACATCATCGCCCACCTCCCACTCCAAGGTCGACGTAGCTGTCACTCTACCCCCAGGACCCGCGGAGAGCTATAGAGATAAAATATGTTTAACTATAGTTAAATAA

Protein sequence:

>DPOGS210083-PA
MSVQNVIRIILVMCVVGKVRNGGPEQTTEKETRPEYIHPDGLLNFDPYLSDFKPVEEEDDYIPLPENDETIDGLPIFLVEPKNSFVLRSKPATLLCRAANALQVYFKCNDVRTDKTVQLEHVDPQNGVRVVEAELNITRNELDEYFGGKYSCECYAWNSKGRIRSQGVFIEFAYIKKQFSQQPQSVTAEAGRQVTFHCSPPPSAPPATIKWIRNGLTIEPTDETLVLPRVGIQDMANYTCIAENIAGRRESDVAVLSVYGKECLFFIEDIFIYSRSCISFANIDSMAVGLTGVPGFLVGVDPRPVEDGGRGPARNQRQQMVEHPVEGSPLRVTMTVSDAKAVHGPLGTPGRLVVRNASGCAGDNVSVLVQDPLYNTPPALTAYAQLVSCEVYLKPFYGIGVAFIDKKTTVDFTASIRLYKNNMSLYVGIGITVAMLAAGAAIMYVWCRYRTVRPGYSAARTGQYKHYPDFVTGLLERVYSRDKAAAPDLTRTCNEYWMRPDNHYDMPQMRDSYASPFGNRSHQQSSAGSNHYEMKPYSPSAESASSCYTNSRTMTSRSSECSSQLAELVTSSPTSHSKVDVAVTLPPGPAESYRDKICLTIVK-