Monarch geneset OGS2.0

DPOGS210087
TranscriptDPOGS210087-TA3051 bp
ProteinDPOGS210087-PA1016 aa
Genomic positionDPSCF300017 + 385600-412304
RNAseq coverage214x (Rank: top 46%)
Annotation
HeliconiusHMEL0117890.092.36% 
BombyxBGIBMGA012668-TA0.086.61% 
Drosophilaunc-5-PA3e-13531.39% 
EBI UniRef50UniRef50_D6WJ910.044.65%Unc-5 n=2 Tax=Tribolium castaneum RepID=D6WJ91_TRICA
NCBI RefSeqXP_391817.30.040.33%PREDICTED: similar to unc-5 homolog B, partial [Apis mellifera]
NCBI nr blastpgi|2700081230.044.65%unc-5 [Tribolium castaneum]
NCBI nr blastxgi|2700081230.044.65%unc-5 [Tribolium castaneum]
Group
Gene OntologyGO:00071651.8e-14signal transduction
GO:00055151.8e-14protein binding
KEGG pathwayame:4082640.0 
 K07521 (UNC5)maps-> Axon guidance
InterPro domain[924-1010] IPR0110293.4e-29DEATH-like
[574-667] IPR0009062.4e-20ZU5
[216-305] IPR0137832e-19Immunoglobulin-like fold
[922-1013] IPR0004881.8e-14Death
[216-303] IPR0130982.4e-14Immunoglobulin I-set
[227-292] IPR0035984.6e-11Immunoglobulin subtype 2
[221-304] IPR0035992.1e-10Immunoglobulin subtype
[302-359] IPR0008845.4e-09Thrombospondin, type 1 repeat
Orthology groupMCL10182 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210087-TA
ATGGCGCGCGGGAGCGTGCTCCTCCTGCTATGGGTGGTCGCCGTCCTAGCGGATAAACATGACGAACAAAGCTCCACTGAGCCTGAGCATACAACAAGTGCTCAAACACCGCCGATTCCATTGCGACCGGAGTATACACACCCAGCTGGCGTTGAAGCATTAGATCCATATAACTCAGAAAAGAATAATCCTGTACCTCACCATCGCGATTACTTGGAAGAAGATTATGATCACGAAACTCGAAAGGAAGATGAACATGAGAACAGGGAGGATGCAGACGAACTACCCTCTTTATCTGGAGATCACCTTCTTCACAGTTCAACAGATGATCTTCCTATGTTTCTATTGGAACCTCAAAACGCGTATGTGGTCCGTAATAAGCCGGCTATGTTGAGGTGTAGAGCGGCAAATGCTCTTCAAGTGTATTTTAAGTGTAATGAGGTGCGGTCAGTTGGGAGCACGCAATTTGAATTTGTGGACCCACAGAATGGTATTCGAATCGTAGAAGCCGAATGCAACGTGACAAGAGATCATTTAGAGGAATATTTTGGTGAGGACAGATTTTCATGCACGTGCCACGCTTGGAGTAGCCGCGGTGACATAAAAAGTCAGCCAGCTATAGTTGAGCTTGCTTATCTGAAAAAACAATTTGTGCTGTCACCGTCTCCAACATCTGTGGAGGCTGGGTCTGCAGCGTCACTGCGCTGTTCACCTCCGGCTGGGGCTCCGACTCCAAGGATCTCATGGCTAAAGCACGGTATGCCGCTACAGCCTGACCACAACGTACTGATATCTGCCGAAGGAAACCTACTAATAACGAGGGCTACACAACAAGATATGGCAAACTACAGCTGTGTTGCGGAAAACATCGCCGGAAAAAGAGTATCGGAAGCTGCCACATTGACCGTTTATGTTAATGGCGGTTGGAGTTCATGGGGCCCGTGGACACAATGTCGCTGTAACGGTCACATAGCTGGCCAGCGACGCACGAGATCTTGTACGGAACCCCATCCCCTCAATGGCGGTGCACCCTGCCAAGGTCAAAGTGTACAAAAAACTGCCGACTGCGTTCTATGCCAGAGCGCATCACCTGGTCGCTGGGGAGTTTGGACGGAATGGTCAGTCTGTGGTTCAGACTGTAGACAAACTAGAAGAAGATCTTGTGCTGGAGACGCTTGCTCCGGCGCTCAGGTACAACATGCTGACTGCGTCGGCGACTACTGCGGAGTTCACGGAGTCGCCGCGCACAGTGACATCCCTCTGTACATTGGTATAGCGGTGGCTGTGGTGGTATTCCTTGCGGGAGCGGCTGTCGTCTACAAACTCTTACAGAGAAAAACTCGGGACCATTCTCTGTACACTATGACAAGAACTGATTTCCAACCAGAAATTTACCCGACAGTAGAGAAGCAATCTCTATCCCTAGCGCCAGACCTAATGCAGCATAGACCACCTCGTTATGAACACCCTCAGCCGGATCCAAGAACTGAACATCATTACGATGTGCCACATCTGACCAATAGTTACGCGTCGCCTGTCGACCATCAAGTGACTCCATGCGCCTCAAGCAAGGGCCAGAGCGATGAATATGACAGCAAACCATACACTGAATCCGAACATTCAGCCTCCAGTTGTTTTACTAGTTCTGGTTCGATGTACGATGCAGCGAACGAATCAGTGACGCTGCAACTAGCGGAACTCGTATCAAACTCACCAATATCTCAGTCACTTTCCGTATCGAGTTCCGGGGCAAGACTGGCTTTACCCGCTGCCGGTGTCGCTTTATCGATACCAGAAGGTGCAGTTAGTAGGGGACGACGTGAACAAATTTATGTAGCCGTAGTCAAAGACGACAGATATAGATCGAGGCTCGGTAAAGGTATCACGCAACTCAGTCCAGTAGTTAAATGCGGCCCGCCTCGTCTTCAGCTTAATAAGTCAGTTATCCTTCAAATACCTCACTGCGCTAGTCTTAAACACGGCTTCTGGAATCTTGCCCTGTACGCCATAGATCATAATAATGCGAAAGCCGATACACACCCCCAATGGAAGAAGGTCGTTAGTCTCGGACAAGAAACTATAAATACACCAGTTTTCACTCAATTGGACAGCGAAAAGATATACCTCGTTACGGACATGCTTTCAACATTTGTCCTTGTCGGTGAAAGTTTTAATGGGAAGGCAGTTAAAACACTACAATTAGCTATTTACGCACCTGCTATGATAACGGATTCTTGTACGGAATTTAATATACGTCTGTATATTTTCGAAGACACTCCATGTGCTCCTTACTACTGTCAGGAACAAGAAAAGAAATTGGGCGGAGTTTTACTCGAACGACCGAAAACTCTTTTATTCCAAGACGGAGGTTCGAATTTGTGTTTGAATTTAGAACATTTGAGTCCCGGCTGGAAAGCTAAACCGGGAATCGGATACCAGGAAATCCCTTTCAACCATGTATGGGGATCGAATTACAATGCACTGCACTGTAGTTTTGCATTAGAAAGGACTTACGACTGCGATAACATCGACTTTACAATCTCCGCTTGTCAAAGAACCAATCCATCTTATAAACAGACGTTCCGAATATTAATAGACGATCTACGTAGCCGCTCACCCCGTGGTCGTTCGCCCCGTGAATCTCAAAGGAGTTTTAATATTACTGAAAATCTTCTAGAGGCTAGAATGTGCCGAAGTGATGAAGGCAGCGATGGCAATCTCAAAAGGAATGTGACCGTCAATGAAACGGGTGTAAACGAGTGCGCTGAAAGTCCATTCAAATTAAGTGTAAGGGTCAAACGTCAGTTATGCGCTATATTAGACCCTCCGAACGCGCGTGGCAACGACTGGCGCGCGCTTGCGGCTCGCCTCGGAGTCGATCGCTACACGACATGGTTTGCCACTAAGAGCTCACCCACTGAGGCTATCCTGGAACTATGGGAGTGCAGAGAACGCGGTCCCGGCGCAACGGCTTCCCTCGCCGCCGCACTGCGTCAGATGGAACGACACGACGCAGCTGACCTACTACACGGAAGACCTTCTTGGTTATGA

Protein sequence:

>DPOGS210087-PA
MARGSVLLLLWVVAVLADKHDEQSSTEPEHTTSAQTPPIPLRPEYTHPAGVEALDPYNSEKNNPVPHHRDYLEEDYDHETRKEDEHENREDADELPSLSGDHLLHSSTDDLPMFLLEPQNAYVVRNKPAMLRCRAANALQVYFKCNEVRSVGSTQFEFVDPQNGIRIVEAECNVTRDHLEEYFGEDRFSCTCHAWSSRGDIKSQPAIVELAYLKKQFVLSPSPTSVEAGSAASLRCSPPAGAPTPRISWLKHGMPLQPDHNVLISAEGNLLITRATQQDMANYSCVAENIAGKRVSEAATLTVYVNGGWSSWGPWTQCRCNGHIAGQRRTRSCTEPHPLNGGAPCQGQSVQKTADCVLCQSASPGRWGVWTEWSVCGSDCRQTRRRSCAGDACSGAQVQHADCVGDYCGVHGVAAHSDIPLYIGIAVAVVVFLAGAAVVYKLLQRKTRDHSLYTMTRTDFQPEIYPTVEKQSLSLAPDLMQHRPPRYEHPQPDPRTEHHYDVPHLTNSYASPVDHQVTPCASSKGQSDEYDSKPYTESEHSASSCFTSSGSMYDAANESVTLQLAELVSNSPISQSLSVSSSGARLALPAAGVALSIPEGAVSRGRREQIYVAVVKDDRYRSRLGKGITQLSPVVKCGPPRLQLNKSVILQIPHCASLKHGFWNLALYAIDHNNAKADTHPQWKKVVSLGQETINTPVFTQLDSEKIYLVTDMLSTFVLVGESFNGKAVKTLQLAIYAPAMITDSCTEFNIRLYIFEDTPCAPYYCQEQEKKLGGVLLERPKTLLFQDGGSNLCLNLEHLSPGWKAKPGIGYQEIPFNHVWGSNYNALHCSFALERTYDCDNIDFTISACQRTNPSYKQTFRILIDDLRSRSPRGRSPRESQRSFNITENLLEARMCRSDEGSDGNLKRNVTVNETGVNECAESPFKLSVRVKRQLCAILDPPNARGNDWRALAARLGVDRYTTWFATKSSPTEAILELWECRERGPGATASLAAALRQMERHDAADLLHGRPSWL-