Monarch geneset OGS2.0

DPOGS211421
TranscriptDPOGS211421-TA4317 bp
ProteinDPOGS211421-PA1438 aa
Genomic positionDPSCF300115 + 402542-414833
RNAseq coverage60x (Rank: top 68%)
Annotation
HeliconiusHMEL0056760.081.79% 
BombyxBGIBMGA010901-TA2e-13870.88% 
Drosophilaunc-5-PA4e-7844.88% 
EBI UniRef50UniRef50_D6WJ918e-16035.87%Unc-5 n=2 Tax=Tribolium castaneum RepID=D6WJ91_TRICA
NCBI RefSeqXP_391817.32e-14434.41%PREDICTED: similar to unc-5 homolog B, partial [Apis mellifera]
NCBI nr blastpgi|2700081233e-15935.87%unc-5 [Tribolium castaneum]
NCBI nr blastxgi|2700081233e-16136.32%unc-5 [Tribolium castaneum]
Group
Gene OntologyGO:00071651.5e-07signal transduction
GO:00055151.5e-07protein binding
KEGG pathwayame:4082645e-144 
 K07521 (UNC5)maps-> Axon guidance
InterPro domain[1-404] IPR0237965.4e-89Serpin domain
[1350-1434] IPR0110291.5e-18DEATH-like
[617-706] IPR0137833.5e-16Immunoglobulin-like fold
[1000-1091] IPR0009064.2e-16ZU5
[617-696] IPR0130981.6e-10Immunoglobulin I-set
[706-761] IPR0008843.2e-10Thrombospondin, type 1 repeat
[622-705] IPR0035993.1e-09Immunoglobulin subtype
[1358-1433] IPR0004881.5e-07Death
Orthology groupMCL10182 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211421-TA
ATGAAATTATTGTTGTTAGTGGCATTTTTGATCTCGATATGGACGTGTACGATTGGGAAAAGACACAGAAAAAATAAATATCTTAAGTTTGATGGTAACGACACTGATGACTTCAATCAAGAACCGTCGTTTAATGACACGGATTCAGCCATTCATTTGGATTTAAAAGGAGATCTCAGGAAGTTATTCTTTGTGTTGAGCGCTGAGTCTAAACCGGAATCTAATACATTATGCTCGCCAATCTCAGCAATTTTGCCGCTTGCTAAATTGGCGTTGGGAGCTCAGGGGGACAGCTTGAAGGAGTTACTTTCAGCTATTGGTGTCCGCAAAAGAGAGATGATCAGTAAACAGTTCAAGCCTCTCCTGCTGCAGCTTCGTTACCTTCCCGGTGTAAGGCTCGATATCGCTAGTAGGCTGTACGTGTCCCAGAACGCTCGTCTTAACCGCAAGTTTGTAGATTTAGCACGAGACATATTTGAGAATAGTGCCGCCAAGATTGACTTCAATTTCCCAACGTACGTCGCCGAGGAGATAAACGCCTGGGTATCCTCACAAACCAATGGCATCATTAAAGACATGTTTGAGCCGACGGACATCTCACCGTCCTCTTCGTTGGTACTCGTCAATGCGGTGTATTTTAATGGTCGATGGGAAACTCCGTTTGAAAGTGTTAAAATTGGAACGTTTCACACAACGACCAGCCAGAAAACCATACAAATGATGTCTCTCAATGGAGAATTTAATTATACATCCAGTGAGGCTCTGGGATCTCAGGTAATCAGTATCCCGTACTCCGGAGGTCGCGCATCCCTGGTGCTGGTGTTGCCTTTGTCTCGGACCGGTCTGCCGCCTCTCTTGCACGCCCTCAGACTCGCGCCCTGGATGCTGCGGGCCGTCTTCGATGAGATGACCCACACGCGAGTACACATCACCATGCCTAAGTTCAGTGTCACCTCTGAGCTTGACCTCGCGACAGCATATAAAAAGTTAGGCCTCAAAACAATTTTCGACGCGAACCTCTCGGGCCTGCTGACGATAGTACGAGACGAGAAAATTTTCATTTCAAATGCGAAACACAAATGTTACATAGAAGTGAACGAGTATGGGACGGAGGCTGCCGCGGCTTCCGGTACGGTATTAATGAAGTTGTCTCCTTCCAAGCCGATAGATTTCCACGCTGACCACCCGTTTCTGTACTTCATAATGCAATTGAATGAAAAAGACAAATGCGATGCACACTCTCGGACAGGTGTCGTAATACCCGATGTAGATAGTAATGGCGACACGAACAGACCTTTAGACCTCCTAGCGAGTCCAAATGAAGCGAAAGACTACTTTCTACCCCCGGTTTTCCCCCACACTGATGACGTAGACGAGGATGATGACGAAGAAGAAGATCACTCTTATTTAGATTACGATTATGATCACAAGGAAATCAATTACAAAGATTCCCCTGATATAGTAACCGTTAAAAGCGACTATCAGGAGGTACTCCCGATAGATGATAAAATACCGTTGCACACAGCTAAAGATAACTTGCCAGTTTTTCTTTTGGAACCAGAAAATACCTACGTCGTTAAAAATAAGCCAGCAACGCTTAAGTGTCGTGCCGCTAATGCCTTAGAAGTTTATTTCAAATGCAATGGCATTAAAACCGAAGCTCTCAATTTTGAATTCGTGGATCCCCAAACCGGAGTCAGAATTATAGAGGGCGAGTATAAAGTAACGAGGGAACAGGTCGAGGAATACTTTGGCACAGAGAAGTACCAATGTTCCTGTTTCGCCTGGACGAGCCGCGGACAGATAAGGAGCCAACCAGCGACTATAGAACTCGCTTATATAAAGAAGCACTTCTCAGCATCTCCCCAGTCCCAGCTGGTGAAGCAGTATTCTGCTGTAACATTCCGTTGCGAGGCCCCTCCAGCTGCTCCCTTCGCCCAGGTCTACTGGCTGAAGAACGGAGCACCTATCGCCGTCGATGACAACGTGCACATTAACAAAGAAGGAGATCTTGTGATCAAACAGGCCTCTCTTCTGGATATGGCGAACTACACGTGCGTCGCCGAGAACCTCGCCGGCAAACGGCTATCCGAGCCGGCTCTGCTCACAGTGTTTGTGAACGGCGGCTGGTCATCATGGTCGACTTGGTCTGAGTGTTCATGTGCTGGCGGAGGTCGTCGTCGAGAGAGAACCTGCACTCACCCTCGCCCCCTCAACGGCGGACAGCCTTGCTCTGGACCCGCGGTGCAACGAACACCTGAATGCACTCATTGTGAACTAGATGTCTACATGGATAGTTTGGATGAGTTGGCAGATGAATCACCTGATGTCGTGTTAGGTCACTGGTCACAGTGGTCGGAGTGGTCATCCTGCGACACAGACTGTCTTCGTTCCCGACGTCGTCGTTGTGTCTCTGGACCCTGTCGTGGCCGCGACGCACAACTGGCGCCCTGTCCGCTATGCGTACGGACTGCTAATGCGCATGCACATGGTGCATCATACTGGCCGCTCCTGATTGCCTTATCGATAGCTTTTCTAATATTTGTTGTTGTTGTTGTACTCGGCATTAAGTATATGAAGATGAAAATTGCTGAAAACTCTCCCTACGTAAAGCCTCCTCCGGGTACGAATTACTTCGGTAACGTTATAAAAAGGACTCTTACTAATCAACCGGACTTGACGATACACGAGGAGTTCCACACAATGGATTCAAGACGAACAAACAGACAGACGAACAGCATCAATAATAGAAACGAACACTTGTATGAAGTCCCGCAATTGGCTAACAGCTACATCGCGCCATTAGATCACGAGGTAAGAAACCATGGAGTGGAGACATTCGCTTGCAAGGACAAGGAGTCAGACCGCTCTGACTCCAGCTGTTTCTTGAGCTCGGGTTCGTCGTATGGAAATGAAAGTGTAGAAATGTCACCTTCGCTCAAAAATAACGCGTCTGTTAATTCGAAATTTTTAAATTGTCAATATTTAGAGACTGCTACATCCAAAATTGTAAACGGTGACGGCGATTGGCTAAATTTGGATAAATGTGGGGTCAGATTATATATTCCGGACGGCGTGGTTGAGAAAGGCGAAGAGTTATTCTCTATTGAGGTCACAGACGAAGAGTGGAACAAACCGCTTCTACAAGAAGGTGAAACACAACTTAGTCCAATAATTCGATGCGGGCCGAAGAATGTCCATTTCTGCAAGGCAATGATACTCACATTCCCTCATTGCGTAGCCTTGAAGAACTCTAGCTGGATACTGTCAATACTTCAGAAGCCCGAAGAAATCAACTGTAGAGAATGGAGGAAGGTCCTAACATTGGGGCAGGAAACCCCGGGAAGCCCCATATTCGCTCAAGTCGATCCTTTAAAAGTGTACATCGTGTGCGAATTTCTCAGTGACTTCGTTTTAATCGGTAGAAGCTTTAATGCTCTAGATGCCAAATTGTTAAAAGTAGCTCTTTTTCTTGGTAAACGTTCTGACGGCTACTACAACCTGCACGTCCACGCGTTTCATGACACTCCTTACGCCTTGTACGAATGTTTCGAAAGCGAGAGACGTACGGGCGGGCTTCTTCTCTGTGATCCGAAAACTATTTATTTTCAAGAGAATTGCTCTGATCTGTGTGTCAATGTTCGGGATGTGGGAGTGGGATGGAAGATCCAATCAGGAAATAAATATCAGGAGCTTCCGTTTGCCCAATTATGGAATATGAATGTCCGATCTCTGCATTGCCTTTTTGTTTTACAACAGGTGGATGCTATAAATTGTTTAGACCTAAACGTCACGGTTTATCAAAAACAAAGGCAGTCGAATTCGGTAAATTTCAATTTGAAAACAAACGATTTTAATTCGAATCTGAATTGTAATAAACGGTTCAGCTATTCGGGCGTAGAGATCGGTTACAGCGTTAATATCCTCGAGAATCCGTTCAATTATTCATCGTTATCGAAAAAAGAGGCCAGATATGATTTCGTTTACAGAAACAGTTTGCGTAGGAACAGTTACAGCTTGGATAATAATTGCTACAGAACAAACGTTCTGACCAAGTCTGACAGGGTGATACTCTGCAAGTTGTTGGATTGTCAGACGACTAAAGGTAACGACTGGAGGTTGCTCGCGGAAAAATTAAAGCTAACGTCCTTCTATTACTATTTCTCTAACACTTGCTCTCCGACTGAGAATATATTGAATCTTTGGTTATGCAGAAACAACGATGTCAATATGTTAATAAGTTTATCGCGGGTGTTTAGGGAGATGTCTCGGATAGACTGCGCGACTGTTATCGAAAGGCGCTGTTTTCCTAATTAG

Protein sequence:

>DPOGS211421-PA
MKLLLLVAFLISIWTCTIGKRHRKNKYLKFDGNDTDDFNQEPSFNDTDSAIHLDLKGDLRKLFFVLSAESKPESNTLCSPISAILPLAKLALGAQGDSLKELLSAIGVRKREMISKQFKPLLLQLRYLPGVRLDIASRLYVSQNARLNRKFVDLARDIFENSAAKIDFNFPTYVAEEINAWVSSQTNGIIKDMFEPTDISPSSSLVLVNAVYFNGRWETPFESVKIGTFHTTTSQKTIQMMSLNGEFNYTSSEALGSQVISIPYSGGRASLVLVLPLSRTGLPPLLHALRLAPWMLRAVFDEMTHTRVHITMPKFSVTSELDLATAYKKLGLKTIFDANLSGLLTIVRDEKIFISNAKHKCYIEVNEYGTEAAAASGTVLMKLSPSKPIDFHADHPFLYFIMQLNEKDKCDAHSRTGVVIPDVDSNGDTNRPLDLLASPNEAKDYFLPPVFPHTDDVDEDDDEEEDHSYLDYDYDHKEINYKDSPDIVTVKSDYQEVLPIDDKIPLHTAKDNLPVFLLEPENTYVVKNKPATLKCRAANALEVYFKCNGIKTEALNFEFVDPQTGVRIIEGEYKVTREQVEEYFGTEKYQCSCFAWTSRGQIRSQPATIELAYIKKHFSASPQSQLVKQYSAVTFRCEAPPAAPFAQVYWLKNGAPIAVDDNVHINKEGDLVIKQASLLDMANYTCVAENLAGKRLSEPALLTVFVNGGWSSWSTWSECSCAGGGRRRERTCTHPRPLNGGQPCSGPAVQRTPECTHCELDVYMDSLDELADESPDVVLGHWSQWSEWSSCDTDCLRSRRRRCVSGPCRGRDAQLAPCPLCVRTANAHAHGASYWPLLIALSIAFLIFVVVVVLGIKYMKMKIAENSPYVKPPPGTNYFGNVIKRTLTNQPDLTIHEEFHTMDSRRTNRQTNSINNRNEHLYEVPQLANSYIAPLDHEVRNHGVETFACKDKESDRSDSSCFLSSGSSYGNESVEMSPSLKNNASVNSKFLNCQYLETATSKIVNGDGDWLNLDKCGVRLYIPDGVVEKGEELFSIEVTDEEWNKPLLQEGETQLSPIIRCGPKNVHFCKAMILTFPHCVALKNSSWILSILQKPEEINCREWRKVLTLGQETPGSPIFAQVDPLKVYIVCEFLSDFVLIGRSFNALDAKLLKVALFLGKRSDGYYNLHVHAFHDTPYALYECFESERRTGGLLLCDPKTIYFQENCSDLCVNVRDVGVGWKIQSGNKYQELPFAQLWNMNVRSLHCLFVLQQVDAINCLDLNVTVYQKQRQSNSVNFNLKTNDFNSNLNCNKRFSYSGVEIGYSVNILENPFNYSSLSKKEARYDFVYRNSLRRNSYSLDNNCYRTNVLTKSDRVILCKLLDCQTTKGNDWRLLAEKLKLTSFYYYFSNTCSPTENILNLWLCRNNDVNMLISLSRVFREMSRIDCATVIERRCFPN-