Monarch geneset OGS2.0

DPOGS203991
TranscriptDPOGS203991-TA2622 bp
ProteinDPOGS203991-PA873 aa
Genomic positionDPSCF300005 + 1304026-1306647
RNAseq coverage54x (Rank: top 69%)
Annotation
HeliconiusHMEL0072280.056.80% 
BombyxBGIBMGA002136-TA0.052.35% 
DrosophilaSema-5c-PA3e-1831.90% 
EBI UniRef50UniRef50_UPI00021A6E2A3e-4630.58%UPI00021A6E2A related cluster n=4 Tax=unknown RepID=UPI00021A6E2A
NCBI RefSeqXP_001661750.11e-2940.00%cell adhesion molecule [Aedes aegypti]
NCBI nr blastpgi|3407158961e-4530.58%PREDICTED: SCO-spondin-like [Bombus terrestris]
NCBI nr blastxgi|3407158962e-4630.58%PREDICTED: SCO-spondin-like [Bombus terrestris]
Group
KEGG pathwaydre:5612954e-19 
 K06841 (SEMA5)maps-> Axon guidance
InterPro domain[100-156] IPR0008842.4e-15Thrombospondin, type 1 repeat
[154-306] IPR0089851.2e-14Concanavalin A-like lectin/glucanase
[346-528] IPR0133201.6e-09Concanavalin A-like lectin/glucanase, subgroup
Orthology groupMCL26563 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203991-TA
ATGAGACGCTGTGATACACCGCCCCCCTCTCTATCGCATTTGATTTGTCCAGGAACGCCACTGCAATACGAACAATGTGAAGGAGATCAATGTGCGATCGACGGTCAATACGTTGAGCAAAGTGGCAGTTGGAGCGAATGGGGTGCCTGGACTGAAAGTTCAGAGAAATGTGGCTATGGAGTTCGTCGTAGGAAAAGGGCTTGTGTCGAGAAGCAACTAGCACTTTCTGCTATAAATTGGGGAACACATTGTAGAGGACAGTATGACGAATTAGATGTTTACTATAACACAGAATGCGTTTTGGATGGAGGCTGGTCTGGTTGGGGACCATGGGGGCCGTGTTCTCAAACATGTGGCGCTGGCAGACGTTCAAGAACTAGATCCTGTACAAGACCAATACCTTCGGGCAATGGTACCGATTGTGTGGGACCAAAATCTGACGTTGGGACATGTCATTTAGCGCCTTGTGAAGTTTTTACACACACTATTTCTTTACTTAATGGTGACTCTTATATGCACTATAATTTCCCACGCAAAAGATCAACCTTCTTTCATTTTTATATCCGTTTTATGGCACTTTCTCCCCACGGAATCATTGTTCGACGTGGAAGTGCTCAAAATCCAAGTGTTCGATTAAGTTTGCAGAAGTGGCATGTTTGTCTAGACGCTAGCGGTTTGTCCAAATCTTGCAGTTTACCTCGTATGTGTTCAACAACAGCAATCGAGCCTGCAACATGGCATTCAATCTTAATGTCTGTTACGAGCCAAGTAGCTATCATAAGATTAAATGACGCCCAAATATCAATGCAGAACTGGTTTCCTTGCAATCCTGAATTAGAAAATGACAAAATGAATATTTTTATTGGAGAAAAGTTTCATGGCGAAATTCATGAAATTATGCTTAATTTTATTCCATTACATACCATTATTGAACGGGAACAGCGAATGAGTCAATCAGATTTCTATCCTATTTCAACGTCTAACATGGCCTATGAAAAAGCTAGCCCTGAAGAAGCTTATTTACTAATACAAAACGATCAATATTTACGTTTGCCTTGTTTTAAGGAACAAGATGAATGGCAAATTGAATTGACAATTAAATCTGAAAAAGAAAGTGGTACAATAATTTTACTTCCAAATAACATAAACGATAACTGGTTGTATGTAATTTTACAAAATATGAGATTAAAAATAAAATTTGCTCGAAAGGAATTCAAGTCGGAAGCAATTAGTTCAACTGAATTTTTACCTGATCAATGGATGGATATAGTTATAAGTAAAAAAAGAGAAACCAATACTATTGTAGTATCAATAAACGCAGGAGAACGTCTTCACGTACTTTTGATAGAAGAAACAAAAAAAATTGGAAAATCACGTATCAATAACAAACATGTACTTCAGGGCCAAAATTATAATCACAGTTTCATAAACAATAACAAGATTGTAATATGCAATGATGAATTTTTTATTGGCGGCGTACCGCTTGATATAAAAAATTCTATATCAGAAGATTTTACACCTTTTACGGGAATAGCAGCCTCAGTAAGGTTAAATAATAATTTGTTAGATTTACACGATTTCAGTATGGAGCGGACCAAAAACGACCTAATTCAACTCTCTTCAAGAACTGCTAGTATTTCAGGATCATATCATGAAACCGAATGGGGTGAATCAAACCAGTTCAATTTAACATGTCTACATGCCCGAACAGCAAGTTTACCACATTCAGCCTATTGGCTTTATTGGGATACCCAAATAAAAATCATAAAAAGTAAAAATGCACGTTCCGTAGATGATGGAAGAGTTTTACGTTTGTTGGTAACAGCTGAAAAAGACCTTAGGGGATATTATACTTGTAGAGCACATAGTAATAGACGCACAAGGAACATCGTTACGTTCGGCGTTTTAGGAAAAATACAATATAAAAGTTTGAGCCCCGACACGTTAACTGTGATTGCAATTTGCACTACACTGTCTTTAGTAATATTTACTTTGGCCTGGCTTATAATAGAAGGATATCACGACCTTCGTAATGGTTATGGATTTTTTAGAGACGCCCATCTTTCCCCCGAAGAGGAGGCGGAAGTGGTTTGTCAATATTTCGAACAAAACATGCACTTACTTGGATCGCAAAGTGAGGTTAATCTAGCTAAGACGAAAGCAAGGCGCAGAGGTAAGCGGTTGGCTAGTAAAGCAAGTTTTGGAGTACAGGAACCAGATAACATGTTAGAAGGAAATAATGCACTAGAAGAATTCACTTCTAGCGATCCCGAGGGTTTACCTACTTTACCTGAGATAAAAAATTCTGGCATAGAATTTTTTCATAAAATTTATAGATATGAACCTTCCTACGTTAGTTCTCCCCGCCATGGCTCCCTTACCACCCGAACAAAACTATCTTCAACCTCGTCTTTAGACTCACTTGCAAAAGTGTTAGGTTCCCCCTCCTATGTACGTAAGATCGCTAATTTATCTAAAGATAATAAAAGGATTAAAAACTGCCGCTTTAAGAAAGCCAAAAATGAATCAAATCTACTGACCATAAAGTCTTCAACGTTTCTCAAAAAATCACCGGCACATAAGGTGTTGGAAAAATTTCAGGAATTAAAGAGCGATGATTAA

Protein sequence:

>DPOGS203991-PA
MRRCDTPPPSLSHLICPGTPLQYEQCEGDQCAIDGQYVEQSGSWSEWGAWTESSEKCGYGVRRRKRACVEKQLALSAINWGTHCRGQYDELDVYYNTECVLDGGWSGWGPWGPCSQTCGAGRRSRTRSCTRPIPSGNGTDCVGPKSDVGTCHLAPCEVFTHTISLLNGDSYMHYNFPRKRSTFFHFYIRFMALSPHGIIVRRGSAQNPSVRLSLQKWHVCLDASGLSKSCSLPRMCSTTAIEPATWHSILMSVTSQVAIIRLNDAQISMQNWFPCNPELENDKMNIFIGEKFHGEIHEIMLNFIPLHTIIEREQRMSQSDFYPISTSNMAYEKASPEEAYLLIQNDQYLRLPCFKEQDEWQIELTIKSEKESGTIILLPNNINDNWLYVILQNMRLKIKFARKEFKSEAISSTEFLPDQWMDIVISKKRETNTIVVSINAGERLHVLLIEETKKIGKSRINNKHVLQGQNYNHSFINNNKIVICNDEFFIGGVPLDIKNSISEDFTPFTGIAASVRLNNNLLDLHDFSMERTKNDLIQLSSRTASISGSYHETEWGESNQFNLTCLHARTASLPHSAYWLYWDTQIKIIKSKNARSVDDGRVLRLLVTAEKDLRGYYTCRAHSNRRTRNIVTFGVLGKIQYKSLSPDTLTVIAICTTLSLVIFTLAWLIIEGYHDLRNGYGFFRDAHLSPEEEAEVVCQYFEQNMHLLGSQSEVNLAKTKARRRGKRLASKASFGVQEPDNMLEGNNALEEFTSSDPEGLPTLPEIKNSGIEFFHKIYRYEPSYVSSPRHGSLTTRTKLSSTSSLDSLAKVLGSPSYVRKIANLSKDNKRIKNCRFKKAKNESNLLTIKSSTFLKKSPAHKVLEKFQELKSDD-