Monarch geneset OGS2.0

DPOGS205240
TranscriptDPOGS205240-TA1272 bp
ProteinDPOGS205240-PA423 aa
Genomic positionDPSCF300265 + 295373-298534
RNAseq coverage25303x (Rank: top 0%)
Annotation
HeliconiusHMEL0134581e-12451.91% 
BombyxBGIBMGA008736-TA9e-8339.15% 
DrosophilaNrg-PG6e-5936.65% 
EBI UniRef50UniRef50_P313982e-11449.16%Hemolin n=13 Tax=Bombycoidea RepID=HEMO_MANSE
NCBI RefSeqNP_001037088.19e-9943.63%hemolin [Bombyx mori]
NCBI nr blastpgi|1867033811e-12453.70%hemolin [Heliothis virescens]
NCBI nr blastxgi|1867033811e-12354.07%hemolin [Heliothis virescens]
Group
KEGG pathwayxtr:1001247342e-31 
 K06550 (L1CAM)maps-> Axon guidance
    Cell adhesion molecules (CAMs)
InterPro domain[306-418] IPR0137832e-21Immunoglobulin-like fold
[341-412] IPR0035983.6e-16Immunoglobulin subtype 2
[328-421] IPR0130981e-13Immunoglobulin I-set
[335-423] IPR0035994.9e-12Immunoglobulin subtype
Orthology groupMCL25533 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205240-TA
ATGTCGCGATTTTTACCGTATTTTCTGGCTGTATGCGTCGCGGTATGCTCAGGCGCTAAACTGCCTCTGCAAGACGCTCAAATATTAAAGGAGGCGCCGGAGGAAGTCATATTTAGAGCTGGGAAGCCAGAATTAGTTCTGGAGTGTGCTACTGAAGACAAGAGTCAAGCACCCATACGGTACAGCTGGTTTAAAAACGGGCAGCTTTTCGAACCGTCTGGTGAGGTGAGCCAGAGAGATAACGAAGGTACGCTTGTGTTCAAAAATCCTAAGAAAGATGATGAAGGAAAATACCAGTGCTTCGCTCAGACAGAGGCTGGTATAGCCAGCACTAGAATCATTAACGTAAAGCTCGCTTTCATTGACATACCAAAAGTAACACTCCAGAACCACAAACCCATCGAAGGTAAATTATACCAACTGGATTGCGAAATACCGAACAGCTATCCGAAACCGGAAATACAGTGGATCTATCAATCCGTATCCGATCCGAGCATATCAAGGAATATCCTAGACAAACGTATCACGCTATCTCCGAATGGTGTGCTGTATTTTTCTAACGTAACAAAGGAAGATACGAGCGCTGATCATAAATACGTGTGTGTTGCAAAGACACCAGCCCACGATGGTGACGTGGTCTTGGCGGAACACGTCATGGAAGACGTGCAGCCCAGCAGCGGGTCCAACAGCGAGCTGGTGCTCCAATATGTCAGTAACGATATTGTCGGCCAAGTTGGAAAAGTCACTATGATATATTGCATCTATGGCGGAACTCCCCTCGCCCATCCCGACTGGTACAAAGATGGCAAGAACGTAAACAATAGTCCCAAAGACCGCGTCACCAGATATAACCGGACAGCTGGGAAGAGGCTCCTCATCAAGGAGACCTGGTTAGAGGATGAAGGGAACTACACCTGCATCGTGGACAATGAGGTCGGAAAGCCTCAAGAACATACGATCAGCGTTCGCGTCGTCAGTGCACCAGAGTTCACTAAGAAACCGGAGCCGAAAGACAGCACTGTATCCGGCAGGGATGTGACGATACCTTGCCAAGTTGCAGCACTTCCGGTTGCCAAGATAACATGGACTTACAATGCTAAAAGTTTACCTGAAAACGACAAACTTGTGATTTCACAAACAACACAAGGCAATATCACGGTCGCTGATTTGACCATCAAGAATGTCCAAAACTCTGACACCGGATATTACGGATGCAGAGCCACAAATCAACATGGCGATATCTACGCCGAAACACTGCTGATCGTTCAGTAA

Protein sequence:

>DPOGS205240-PA
MSRFLPYFLAVCVAVCSGAKLPLQDAQILKEAPEEVIFRAGKPELVLECATEDKSQAPIRYSWFKNGQLFEPSGEVSQRDNEGTLVFKNPKKDDEGKYQCFAQTEAGIASTRIINVKLAFIDIPKVTLQNHKPIEGKLYQLDCEIPNSYPKPEIQWIYQSVSDPSISRNILDKRITLSPNGVLYFSNVTKEDTSADHKYVCVAKTPAHDGDVVLAEHVMEDVQPSSGSNSELVLQYVSNDIVGQVGKVTMIYCIYGGTPLAHPDWYKDGKNVNNSPKDRVTRYNRTAGKRLLIKETWLEDEGNYTCIVDNEVGKPQEHTISVRVVSAPEFTKKPEPKDSTVSGRDVTIPCQVAALPVAKITWTYNAKSLPENDKLVISQTTQGNITVADLTIKNVQNSDTGYYGCRATNQHGDIYAETLLIVQ-