Monarch geneset OGS2.0

DPOGS208188
TranscriptDPOGS208188-TA3204 bp
ProteinDPOGS208188-PA1067 aa
Genomic positionDPSCF300207 + 254189-274386
RNAseq coverage888x (Rank: top 14%)
Annotation
HeliconiusHMEL0157120.057.67% 
BombyxBGIBMGA010124-TA0.063.29% 
DrosophilaSema-5c-PA1e-17937.09% 
EBI UniRef50UniRef50_F4W9640.039.94%Semaphorin-5B n=6 Tax=Myrmicinae RepID=F4W964_ACREC
NCBI RefSeqXP_394067.20.039.15%PREDICTED: similar to Sema-5c CG5661-PA [Apis mellifera]
NCBI nr blastpgi|3287834020.039.41%PREDICTED: semaphorin-5A [Apis mellifera]
NCBI nr blastxgi|3287834020.039.41%PREDICTED: semaphorin-5A [Apis mellifera]
Group
Gene OntologyGO:00055151.1e-110protein binding
GO:00160201.8e-10membrane
GO:00072751.8e-10multicellular organismal development
GO:00048721.8e-10receptor activity
KEGG pathwayame:4105890.0 
 K06841 (SEMA5)maps-> Axon guidance
InterPro domain[40-473] IPR0159431.1e-110WD40/YVTN repeat-like-containing domain
[24-473] IPR0016271.9e-106Semaphorin/CD100 antigen
[629-690] IPR0008841.1e-14Thrombospondin, type 1 repeat
[474-528] IPR0162011.8e-10Plexin-like fold
[474-521] IPR0036592.7e-06Plexin/semaphorin/integrin
[143-173] IPR0131032.9e-06Reverse transcriptase, RNA-dependent DNA polymerase
Orthology groupMCL11412 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208188-TA
ATGAGTGTAATATACAAAGTGACAGCGATCATCTTGCTTGGAGCTCTCAGCAAGGGGGAGTTACCGGAAGACGAATTTAGGATAATTAATAGACAAGACCTGCTGGCGTCTGATAGCGATGTCTTTGAAGATAACACCAGCAAATCATTTTCACAGCTGTTGTTCGATGTAGCGAGAGACCAAGTTATAGTGGGTGCTAGAGACACGTTATACCGTTTGTCGCTACGCGGTCTCAGGGAACTTGAGCGTGCGAATTGGCCCGCACCAGAGGGTAAGACGAAATTGTGTCAAGACAAGGGTCAGACTGAAGACAGTTGCCGCAATTACATCAAAGTATTGCTCTCTTACGGGCACAAGCTGTTCGCGTGCGGCACAAACGCTTTCAGTCCGATGTGCAGTTGGAGGGAGGTGTGGGCTTATTCCGGTGATAAAAAAATTGTAAACCTAAATAAAGCATTATATGGTCTTAAAAATTCACCCAAGTGTTGGAACACCAAATTTAATTCTGTAATTAGTAGAGAGAGGAAGGGTGGCGCCACTGTGCCTTTGAGTCGTGGCGCTTCGCCTCGTGAGACTGGTATGTTGAGAACAACCCAGTATGACCTGAACTGGCTCAACGAACCGCAGTTCGTGGGCAGTTTTGAAGACGACTACTTCGTATACTTCGTTTTTAGAGAAGTCGCAGTGGAATATCTTAATTGTGGAAAGACAATATACTCTCGTATAGCGCGTGTATGTAAAAATGACTCGGGCGGCCATCTTGTTATGAAAGACAATTGGAGCACCTTCCTCAAGGCACGTCTCAAGTGTCCCGTGCCAGGGGACGTGCCGTTCTACTACGATGAGGTCCAGAGCGTTGAATATTTGGCCCAAGAGAAAATTCTTGTAGCGGTCTTTACTACACCCACAAACAGCATAGCCGGTAGTGCCGTGTGTATATACAACATGTCAGACATACAGATGTCCTTCGAGGGTCAGTATAAGGTTCAAAACTCACCGACCTCTACTTGGGAACCCCTGACACCAACGAGAGAAGCCAGGGAACACTTTAGATGCAACCCTGATCCCAGACCCCATCACAGTATGGAGTACCACAAGTATCATTTAATGTACGAGGCCATACAGCCGATATCGGGCGAGCCGGTGTACAAAGCCATACTAGAAAGATTCACCCACGTCACAGTAGATGTTGTCACGACAAAAAATATCGCCAAGCAATTGGTCGTGTTCGTTGCAACCGAGAACAGTGATGTGCTGAAACTGGCGATACTGCCGCGGTACGAGGGCGCCTGCCTGGTCGAGATATGGAAACTCAAGGACTCTAGAGGCGGATACAACATACAGAACATGCAGTTCGTGAAAGATACGATGTCGCTGTACATCGGCAGCGACACGGGCGTGCTGCGTCTATCGAGCGAGCGCTGCAGCCGCTACAGGAGTCGGGCGGGGTGTGTGGGGGCCGCGGACCCGCACTGCGGCTGGGACGACGGCCGGGAACAGTGTGTGAAGAGTTCACAACACTTGAGAGGAGCTTCCTTTGTGCAGTCCACAGCCAGCTGTCCCGCTGATAACAGTCAAGTGGATGGCGGCTGGTCGTCGTGGTCGGAGTGGGAGCCGTGCATGCAGGATGAGTCCACGCACGTTGTGTACGGAGACGACAAACCCGACATGTGCATGTGTCGCACCAGGAGTTGTGACAACCCTAGACCCGCCAACGGGGGGCAACCCTGTCAAGGTACTTATCCGCTCATTTTTATCGATATTGTGACTGCTCATTTTTATCGATATTCAGATTGCACTATCATCAAACTCAAACATCATCAACCATTGAACGATGAATATAAAACTAGATTTCTATGTAGGCTTCGGTTCGTTAATCCGGCCCTCGCTTCGATAGACGGGTCGTGGTCCCCTTGGGGCCCGTGGTCGGCATGCACCGGCGCAGGTTGTGGTGTGGGCGGGGGGACTCGCGAGCGACGCCGGGTCTGTGGGAGCCCCGCCCCTCGACACGGGGGAGCCGATTGCGAGGGGCCTCGCTTTGAAAGACAGTCTTGTGACCTACGACCATGCGAGGTCAGAAAGGCGACCGCTTGGACCCCGTGGGTGCAAATACCAAGCAACACTTCTGACGGTAGTTACACCGAGAAGAGATTTAAGTTCCTATGTAAGGCGCCCGCACCGGAACAAATCAGGTTATCTCTCGCCCGTGAGGAGGAACGCTATTGTAATCCTCGTGGGGTATGTACTAGTACACCTCCTGAGGAGGATCCGTCGTTTGACGGCTGGGGGCCCTGGGAGGCCTGGGGGGCCTGCTCCGCTGTCTGCGGCGGGGGCCAGCAGCAACGGACACGTCACTGCCGAAGGCCCCCCTGTACGGGGACCGCGGATATGCTGAGGCCATGCAATACGCATGCTTGTCTTGGCGAATGGTCATGCTGGAGTGAGTGGAGCGAGTGTAGCGGTGGATGTGACTCCACCGGCCACCGAACCCGCACCCGTATGTGCGTGTCGCCGCAGGGCTGTGTCGACGCCGGGGCCGCCCTGGAGAGACGCGCCTGTGTCAACACATGCACCGAATCAGAGAGTGGTTGGGGGGCCTGGGGGGCCTGGAGTGAGTGCGAGGGCGGTGAGAGGGTGAGGCGGCGCAGTTGTGAGTCGGGGGCCTGCGTGGGGGCCCAGCTGCAGGCGGCCAAGTGTGGAGACGATGATATGGATAATGAGTTATATGCGATGCCGGCATACAGTCAGAATGTTGAGAGCGCTTCCTTTGTAACAATGTCCAGCGAACCTCTCGGTGTTGGGGGCATTGTTGGCTGTGTCGTCGGAGCTTTTGTTATGGGTTGTCTATTATGTCTGGGGGTGGTGGTGGCGTGTTACCGTCGTCCGTGGAGGTCGGCAGCGCGCGTGCCGTCCAGTCCGCATTACATCACCGCTAAACAGAACAGCTATGTCACAGTGCCGCTTAAAGATGTGCCGCGTAAAGCTAAGCGCCAGCCATCATTCTCGGGTCTTGGCAACAGTAGTGGCATCCTCGTTAAAAGCAATAACTTGTCTAACGCCAACCACAACAACACTATGGCCACCCCCAAACTATATCCCAAGGCCATCGCCAATGAGTACGACTCAATGGGAACATTGCGGAGACATTCCAACCAACCGAACAACAAAACTAATATTGATATTGAAGAGGATAAGTTCTATTGA

Protein sequence:

>DPOGS208188-PA
MSVIYKVTAIILLGALSKGELPEDEFRIINRQDLLASDSDVFEDNTSKSFSQLLFDVARDQVIVGARDTLYRLSLRGLRELERANWPAPEGKTKLCQDKGQTEDSCRNYIKVLLSYGHKLFACGTNAFSPMCSWREVWAYSGDKKIVNLNKALYGLKNSPKCWNTKFNSVISRERKGGATVPLSRGASPRETGMLRTTQYDLNWLNEPQFVGSFEDDYFVYFVFREVAVEYLNCGKTIYSRIARVCKNDSGGHLVMKDNWSTFLKARLKCPVPGDVPFYYDEVQSVEYLAQEKILVAVFTTPTNSIAGSAVCIYNMSDIQMSFEGQYKVQNSPTSTWEPLTPTREAREHFRCNPDPRPHHSMEYHKYHLMYEAIQPISGEPVYKAILERFTHVTVDVVTTKNIAKQLVVFVATENSDVLKLAILPRYEGACLVEIWKLKDSRGGYNIQNMQFVKDTMSLYIGSDTGVLRLSSERCSRYRSRAGCVGAADPHCGWDDGREQCVKSSQHLRGASFVQSTASCPADNSQVDGGWSSWSEWEPCMQDESTHVVYGDDKPDMCMCRTRSCDNPRPANGGQPCQGTYPLIFIDIVTAHFYRYSDCTIIKLKHHQPLNDEYKTRFLCRLRFVNPALASIDGSWSPWGPWSACTGAGCGVGGGTRERRRVCGSPAPRHGGADCEGPRFERQSCDLRPCEVRKATAWTPWVQIPSNTSDGSYTEKRFKFLCKAPAPEQIRLSLAREEERYCNPRGVCTSTPPEEDPSFDGWGPWEAWGACSAVCGGGQQQRTRHCRRPPCTGTADMLRPCNTHACLGEWSCWSEWSECSGGCDSTGHRTRTRMCVSPQGCVDAGAALERRACVNTCTESESGWGAWGAWSECEGGERVRRRSCESGACVGAQLQAAKCGDDDMDNELYAMPAYSQNVESASFVTMSSEPLGVGGIVGCVVGAFVMGCLLCLGVVVACYRRPWRSAARVPSSPHYITAKQNSYVTVPLKDVPRKAKRQPSFSGLGNSSGILVKSNNLSNANHNNTMATPKLYPKAIANEYDSMGTLRRHSNQPNNKTNIDIEEDKFY-