Monarch geneset OGS2.0

DPOGS214072
TranscriptDPOGS214072-TA1305 bp
ProteinDPOGS214072-PA434 aa
Genomic positionDPSCF300171 + 331984-335156
RNAseq coverage387x (Rank: top 31%)
Annotation
HeliconiusHMEL0128705e-17988.35% 
BombyxBGIBMGA010384-TA1e-15278.34% 
Drosophiladpp-PA6e-8946.95% 
EBI UniRef50UniRef50_B8YPW18e-14677.24%DPP n=2 Tax=Obtectomera RepID=B8YPW1_BOMMO
NCBI RefSeqNP_001138801.11e-14677.24%decapentaplegic [Bombyx mori]
NCBI nr blastpgi|2238901723e-14577.24%decapentaplegic precursor [Bombyx mori]
NCBI nr blastxgi|2238901723e-17079.13%decapentaplegic precursor [Bombyx mori]
Group
Gene OntologyGO:00080833.6e-66growth factor activity
GO:00400075.4e-29growth
KEGG pathwayaag:AaeL_AAEL0018769e-93 
 K04662 (BMP2_4)maps-> Basal cell carcinoma
    Pathways in cancer
    Cytokine-cytokine receptor interaction
    TGF-beta signaling pathway
    Hedgehog signaling pathway
InterPro domain[97-434] IPR0156154.3e-144Transforming growth factor-beta-related
[333-434] IPR0018393.6e-66Transforming growth factor-beta, C-terminal
[92-291] IPR0011115.4e-29Transforming growth factor-beta, N-terminal
Orthology groupMCL13956 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214072-TA
ATGGTAGGCTCAGTGGTATCAGGGGCGGGGGGCGGGGCGATCACGGTGATCACGATCACGACCACGACGCCCACGACGCCCACGACGCCGCCGCGTCTGGAACAGCCACGGCCTCTCATTCCGACCGCGGACGCGAAGCCGCGTGCACGTCATGAAAGTCACACCTGGAGGCGTCACGATCGTGGGTCGAGGAGAATTATGCGTGGGGCGTGCGCGTGCGCGGTGGTGTGCGCGTTGGTGGCGTTGTGCGCGGCTGCGGGGCTGGACGAAGGGACCCGCGCCGCCGCCGAGAGGCAGCTGCTGGCACTCCTGGGGCTGCCCCGTAGGCCGCCGCACACACACCGCCCTCCGCCGCCCGTGCCTCAAGCCATGCGACTCCTATACGACGAGAGCGCCATCCCAGCGGCAGCCGCCAACACGGCTCGCTCTTTCTACCACACTCCCACCGAGCTGGACGATCGCTTCCCTGGCGAGCACCGGTTCCGACTATACTTTAACATAAGTGGCGTGCCGGCTGATGAGATAGCGAGGGGTGCGGACCTCACGTTCCAGAGGGCTGTTGGAACGACCGGCAATCAGAGACTGTTATTATACGACGTCGTTAGACCGGGACGTAAGGGTAAGAGCGAGCCCATACTGAGGTTACTAGATTCGATACCACTCAGAGTTGCCGAGGGTACGGTGAATGCGGACGCTCTCGGTGCTGCGCGGAGATGGCTCAAAGAGCCTAAACATAATCATGGACTTCTAGTGCGTGTTTTAGAAGAGGGATCGGGTAGTACGGACGCTAAGTTTCCACATGTGCGTGTAAGGAGACGTGCGACGGACGCTGAGGAGGAATGGCGATCTTTACAGCCATTATTGATGCTCTACACGGAGGATGAGAGAGCGAGGGCTGCCAGAGAACGTGGAGAAACAAAGTTGACAAGAAACAAGCGTGCGGCTCAGAGAAGAGGCCACCGCGCGCATCACAGACGTAAGGAAGCGAGAGAAATATGCCAGCGTCGGCCTCTGTTCGTGGATTTCGCGGACGTTGGCTGGAGCGATTGGATCGTCGCACCTCACGGCTACGACGCATACTATTGCCAAGGCGACTGCCCTTTTCCGCTAGCGGATCATCTGAACGGCACGAATCATGCGATCGTGCAAACTCTGGTCAACTCAGTGAACCCTGCGGCGGTGCCCAAAGCGTGCTGCGTTCCCACGCAACTCTCCTCTATATCTATGTTATATATGGACGAAGTGAACAATGTGGTGCTTAAAAATTATCAAGATATGATGGTTGTGGGTTGTGGTTGCCGATGA

Protein sequence:

>DPOGS214072-PA
MVGSVVSGAGGGAITVITITTTTPTTPTTPPRLEQPRPLIPTADAKPRARHESHTWRRHDRGSRRIMRGACACAVVCALVALCAAAGLDEGTRAAAERQLLALLGLPRRPPHTHRPPPPVPQAMRLLYDESAIPAAAANTARSFYHTPTELDDRFPGEHRFRLYFNISGVPADEIARGADLTFQRAVGTTGNQRLLLYDVVRPGRKGKSEPILRLLDSIPLRVAEGTVNADALGAARRWLKEPKHNHGLLVRVLEEGSGSTDAKFPHVRVRRRATDAEEEWRSLQPLLMLYTEDERARAARERGETKLTRNKRAAQRRGHRAHHRRKEAREICQRRPLFVDFADVGWSDWIVAPHGYDAYYCQGDCPFPLADHLNGTNHAIVQTLVNSVNPAAVPKACCVPTQLSSISMLYMDEVNNVVLKNYQDMMVVGCGCR-